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J AND^AliV ]i)47 


THE VARIANCIE OP THE OVERLAP 
WITH. REFERENCE TO A 


OF OEOMF/rRKJAl. 
BOMBlN(j PROBLEM 


FIGUPvEH 


P>Y F, LARVVOOJ), Fn.D, 


1. Lnthoduction 

The present paper deals with a particular ])rol)leiii arising in the niatheinati( 5 al study of 
bombing. Briefly, the general problem is that of predicting the over-jill ellects ota Imnibing 
attack carried out under given condit^ions against a. given target, and the inathoinatical 
treatment involves various simplifying assuin[)tions coiuicruing these (iondltions. 

In the type of problem consideiaMl hcrt‘., attemtion is (xuitred on the total f)lan area of 
damage caused to a single bnihiirig by bombs bdhfjg inde[)endefitly and at randofu over a 
larger area containing the building. It is assumcri i<hat each bo mb damages all that part of 
the building cotitainod within a cirede of lixed si/.(‘ ccntTcd at the homh (a square damage 
area is also considered), whih*. the hiiilding luis a. sim})lc plan outline, such as n n^ctanglo or 
a circle. Tlic area of damagu^ of tavo or more adjaceut bombs is merely the area covered by 
the circles. The t}i(H)r’ctical prohhuns dealt wit<h arc t hose oi'estimating t.he variance of th(‘, 
amount of damaged arisi (tlu' esiiitiation of the mean or (ix])(;cte(l damage presents little 
difficulty). It would h(‘ more satisiactory lo obtain tlu‘. complete friapienc.y distril)utions, 
but this has so far not hiaui /inhii'vtsl, !)or has it l)C(U) possihh^ to obtain explicit^ formulae for 
the 3rd and 4tli moments. 

As there may be appliejations of th(5 problems to ti<dds complcbdy dillercut from those of 
bombing studies, and a,s Liu^y arc ()r()l)]t‘ins which involve (‘ss<uUaally the coiiccpts of geometry 
and of probability, li is (‘onvtmient t-o express tluun (uit iiely in t lu'sc 

We thus have prol)lems of tUc following ty[)c. A numh(u’ oftdrc.h^s arc |)hict‘d a.t random on 
a plane so that each one lias some or all of It-s ansi inside* a fixes 1 sipiai'c,. What art; tJic mean 
and variance of the area of the square (jovered by i.lu? <ar(‘les ^ Tla^ fundamentals of this type 
of problem have been studied by Pv^obhins (lUAi). and Bronowski A Neyman have 
dealt with another particular case.'*' Rohliiiis's results (‘iiahlo us to deal with geometrical 
figures other than circles a,nd s(piar(^s, and also t.n deal with cases where the number, 
position and orieniatioii of the 'cov(U'ing’ tigui'cs follow [)]’oha}>iliiy laws oilier than tiie 
simple ones implied in tlu^ above (‘xample. 


2. Robbins’s tukobkm 

In leading uyi to his theorem, Robliins us(^s t<hc, (amciqil: of a random nu^asurahh*. suhsid.- A" 
of n-dimensional ICiuffidean spaci^ /(/„. tlo d(*tiiu^s tin* I'muttaou f/(.r, A^) for ev(‘ry point x 
of and for eveiy X as equal to 1 for xc X and /(‘ro (‘IsmvlKin*. Tin’s t}u*.or‘(‘ni is t.lieii as 
follows: 

Let X be a ramlom Lehesyue nioafnirahla aiihset of with. ))imsure //,(X). For anij point 
^ F,^lei 2 )(x) = Pr{xeX). Then, assiuninij that the fnvviion (j{x, X) is a measwrable Junction 

of the pair {x, X), the expjected value, of (lit 'measure of X will he given by (he Lebesgue integral of 
the function p{x) over 

* NoUi by Editor, paj>ur was n‘eci\aMl foi’ put^licaDou in Srpteniber 1045; Dr (larwood has 

a.skcd me to add t.lu* toliowiiig noti* in }>ronf. ‘'Tlio author had tlu* ])rivit’^(‘ of .s(‘oinir tlu* M^orlc of 
Bronowski & Neyman in iii-oof*. This ])ap(*r was th(‘n subniittod, after whic.li their work wa.s putilished 
(1945) together with a second art itT* by Kol>l)ins (1045), who has solved, among others, suine of tla* 
problems dealt with in this pap(‘r, as acknowK-dgod in laUa* footnotes.” 
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2 The variance of the overlap of geometrical figures 

Robbins generalizes this result to obtain the mth moment of the measure of X ; this is the 
integral of the function p{xit x,^) over where 

p{Xi,x^, = Prixi^X and ... and x^eX). (1) 

It is useful to give a simple non-rigorous proof of this result. Suppose the space E„ to be 
divided into an enumerably infinite set of small elements Wi, W2> •••• If we assume that any 
particular subset X can be made up of a selection of the w’s, then 

/t(X) = AaWaH- 

where is here used also as the measure of the element ciJi, and where the A’s, appropriate 
to this particular X, are 0 or 1. Hence 

= S S .-ApAg... .... 

P 9 

where each summation is over the whole of and there are m such summations. Thus 
exp {/«(Z)}"‘ = S S... ... exp (Aj, Ag...). 


P 9 


But the expectation of A^Ag... is the probability that the elements ... are m Z, and 

on proceeding to the limit the desired result is obtained. 

The verification of this result in the case, say, of the 2 nd moment of a linearly distributed 
variate, is instructive. Thus suppose a: is a variate with a probability function F{x), i.e. the 
probability of obtaining a value a: is given by the measurable function F{x), where 

F( — co) = 0 and ^(00) == 1 . 

Define Z as the interval from 0 to *; then the expectation of the square of the measure of 
Z is the 2nd moment of x. To use Robbins’s theorem we used 
the probability p(a:^, x^) that a given pair of values and x^ 
both lie in the interval 0, x chosen at random. Using co¬ 
ordinate axes Oxi, Ox^, this probability is zero in the 2nd and 
4 th quadrants, since 0 , x cannot contain two points and x^ 
of opposite signs. In the region A (see Pig. 1), where Xi>x^> 0, 
the two values x^ and x^ ate both in 0 ,x if x>Xi, and the 
probability of this is 1 - F(xj). Thus in A, 

p{xyx^) = l~F{xj). 

Similarly in B p(xj, Xj) = 1 - P(xa), 
while in 0 = Pfx^) 

and in D pKx^) = F(x^). 

The integral of p(xi, x^) over A is seen to be 



j: 


*i[l-P(a:i)]dxi, 


le IS 


while the total integral of p{x^, x^) over the whole plane 

21 x[l - F(x)] dx- 2 f" xF{x)dx. 
.... 'I ^ j —<» 

A single integration by parts then leads to 

j x^dF{x), 

J —CO 

which is the 2nd moment of x, as required. 
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3. Application of Robbins’s theorem to overlap problems 
We shall be concerned with cases in E^, where the subset X is the part of a fixed area A in the 
plane which is covered by a number of areas C dropped independently and at random on the 
plane. We suppose A to be the interior of a simple closed curve, while each G is the interior 
of another curve. The area O has a reference point Q (conveniently called its centre) and a 
reference line, and it is assumed that there is a frequency distribution ^[x^y.O) of the 
position {x^y) of Q and of the inclination 6 of the reference line to a fixed direction in jB'u* 
S) can be assumed to be zero outside an area T, i.e. the points Q are distributed inside 
T. (In the applications the angle d will be constant and the areas G will be equally likely to 
fall anywhere over T, so that we can write y, 6 ) = 1 /T.) Another chance variable is h, 
the number of areas 0; its distribution can be defined by the series ••• 

(which are the probabilities of 0,1,2, ... G'b falling on T), or by the probability 

generating function Q{u), where 

Finally, it ivS more convenient to consider the moments of the area Y — A— i.e. the area 
of A (we can use the symbols F, etc. for either the sets or their areas) not covered by the ( 7 ’s. 
Evidently the variances of X and Y are equal. 

To obtain the 1st moment of 7 , we need first the probability p{xi, yi) that a point y^) 
of A will belong to 7 , i.e. of (x^, yi) not being covered by a G. Now (cr^^, y^) will not be covered 
by a particular G falling at an inclination 6 if the centre Qix^y) falls outside an area 
0 ) obtained by centring the G at {x^^yi) and rotating it through 180 °. If the part 
of T exterior to this area is called T — C(i»i, y^, 0 ), and if we allow all inclinations, the prob¬ 
ability of this occurring is 

U{x,y,d)dxdyde. (3) 

T-~C{Xr,yuO) 

If k C'b are dropped independently, the probability is given by 2/i), so that the total 
probability of (^r;^, y^) belonging to 7 is 

CO 

2/i) = 'EPk^^i^i, Vi) = %(»1. yx)}- (4) 

k=Q 

The 1st moment of 7 is thus, in the case of k O’s, 


and in the general case 




(*i. Vi) dx^dyj_, 


( 6 ) 


P'xiY) 


m I * 


yi)}dxj_dyj_ 


( 6 ) 


A 

The 2 nd moment is obtained by a similar process; we require the i:>robability p(Xx, y^) 
that neither of two points y^) and (Xg, y^ is covered by a 0 . Corresponding to these two 
points and an inclination 6 the permissible region in which each centre Q can fall is 

T- C{x^, 2/n - 0{x^. 2/23 G^. 

Thus for one G the probability is 


giving in the case of k O’s, 


0 


r p p 

y, d)dxdydd^ 


T—Ui—Cj 


(7) 


25(«1, Vx, * 2 . Vi) = q^{oc^, 2/1, * 2 , Vz) and y^, = Q{qi{x-^, y^, x^, y^)) 
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in the general case, Thus the 2nd moment 


^'(7) = ! JJ jf‘{x^,yi,X 2 ,yf)dxidyidXidyz for fc O’s 

(8) 

and"" ““ 

f ff {Q{g{xT„y^,x^,yf)}dx-^dy^dx^dy^ 

1 aJ J aJ 

(9) 

in the general case. In general, for the mth moment, the probability that [x^, yf).. 

* (^m3?/7n) 

are not covered by fc C’& is 

2/l. *2. 2 / 2 . ■ ■ 2/m). 


where 2K.2/i. a; 2 > 2 / 2 > •••: 

. *m. 2/m) = f y’ 

(10) 




and T--C 2 ...-C'„ is the area of r outside <7’s centred at (* 2 . 2 / 2 ). ••• 

J (^m3 ^m) 

and rotated through 180®. 



Thus the with moment is equal to 



■ /*;(u=[ 

f-lJ 

r* 

3*’(*i)2/ii *2> 2 / 2 . •• 

. 2/m) dx^dy^dx^dy ^... dxj,y^ ' 

J * 

A 

A 



in the case of k C^s, or 



1 

■ij 

Ml 

%(a:i.yi,a:2.2/2. 

■ • •. *m. 2/m)} dxj^dy^dx^diJi ... dx^dy„ 

A 

A 


, 


in the general case. 


( 11 ) 


4. Unifoem distribution of oovering areas at OONSTAJSTT INOLIN-ATION 

As mentioned above, in the cases with which we shall be dealing, the areas G are equally 
likely to fall anywhere over T, and the angle d is constant. The function y, 6) can be put 
equal to IjT for points of T and zero outside; the variable 0, and integration with respect 
to it, may be omitted. 

The function q{xi, yi) is the fraction of T not covered by a (7 centred at (x^y t/i) rotated 
through 180'^, and in general ^m^ym) fraction of T outside m 0*8 

centred at (i« 2 , 2 / 2)3 •••» (^rmyni) rotated through 180®. 

Instead of the variate T we can consider YjAy i.e. the fraction of A not covered by k O’s, 
and to obtain its mth moment we divide //.^( 7) by Also the quantity dx^dy ^.., dx^ dy^fA^^ 
is the probability of obtaining m centres in the elements of area dx^ydy^y of A 

if these centres are uniformly distributed over 4* 

We thus obtain the following result from (11); the mih moment of the fraction of A not 
covered by k G's with their centres falling at random on T is equal to the kill moment of the 
fraction of T not covered by m G's with their centres falling at random on A and rotated through 
180°. 

In the case fc = m = 1 we can express this in a slightly different way if we (i) deal with the 
area common to the two areas concerned, (ii) regard all orientations as possible and as 
equally likely, and (iii) deal with areas rather than fractions. We obtain, in fact, the following 
result: the integral of the overlap of G and Ay when the centre of C is taken over T and all 
orientatioM are permiiiedy is equal to the corresponding integral of the overlap of 'C and T,/or 
all positions of the centre of Q on A and for all orientations. 

In the practical cases with which we shall deal, the area A is always ‘well inside ’ T, i.e. 
every point of 4 can be reached by a (7 centred somewhere in T, In such cases the formula 
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for the mean overlap is simple; we have m = 1 and the fraction of T not covered by one 0 
is (T— G)IT, which is constant for all (aj^, y.^), so that its Ath moment is {T- G)^jT'‘, i.e. 

K(rM) = (^)'. (12) 

If the number of C’s follows a probability generating function G{u), the mean is given by 


y[{YIA) = G 




(13) 


For the 2nd moment we are concerned with two O's centred at and 

their common area is Q{xi^ we have 


^^ 2 * 2 / 2 ) = 




T 


(14) 


The 2nd moment 7/4) is then the expectation of or G{q) for all pairs of points over 4, 
and we no longer have a simple formula as in the case of the 1st moment. The overlap Q, 
however, depends on the relative positions of the two (7*s, and therefore the numbei* of 
variables in the integration is reduced from 4 to 2 or 1. This is illustrated in the following 
examples. 

5. ClECLES FALLING ON A FIXED SQUARE 

Assume 4 to be a square of unit side (i.e, 4 = 1), (7 a circle radius a and T a ^square with 
rounded corners’, whose boundary is at a distance a outside the sides of 4. Thus 

T=l + 4a-f7ra^ (15) 

and G = 7ra^. (16) 

It is seen that the fraction qo^ T outside the two circles centres (iTi, y^) and {^ 2 , y^) is a func¬ 
tion only of the distance r between these pointSj and can therefore be written as q{r). Hence 
if <j)[T) is the frequency function of r, we obtain 




q^{r)^[r)dr\ 


or 


V2 


/4(y)= G{q(:r))<j){r)dr. 


The area Q[r) common to two <}iroles radii a with centres distant r apart is 

i3(r) = 2a®(0 —sin6^ coa.0), 
where r = 2a cos 0 (r < 2a), 

and Q[r) = 0 (r^2a), 

1 -|-4a'~7ra^-|“i2(r) 


and 


q[r) = 


(IT) 

(18) 

(19) 

( 20 ) 
( 21 ) 

( 22 ) 


(23) 


1 + 4a -h na^ * 

where i3(r) is given by (19), (20) and (21), 

To obtain the frequency function ^(r) of r, we note that 

where x^, x^, y^ and y.^ are uniformly and independently distributed in the range 0, 1. The 
difference £ = ] — aia | follows the ‘ triangular ’ distribution, 

d/ = 2(l-0d^, (24) 

from g = 0 to 1, so that the quantity 

u- -{x^-x^f 

l-^Ju . 

_ 1 _// 


df = - 




-du. 


follows the distribution 


(26) 
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Similarly, v = follows independently the same distribution 

df^~-j^dv. (26) 

The distribution of r = ^J{u + v) 

is obtained by integrating the product 

fiuv) 

of the frequencies of u and v over that part of the line it + u == r* within the square of unit 
side in which the point u, v can lie, and we obtain without much-difficulty for the frequency 
function of r, 

^(r) = 2r(7r—4r + r®) for 0<r<l, (28) 

and ^{r) = 2r(4 sin~i Ijr+i — 1) ~r® - tt — 2) for l<r<fi2. (29) 

Thus the 2nd moment of the fraction of the unit square uncovered is given by the integral 
(17) or (18), where q{r) is given by (22), (19), (20) and (21) and <p{r) is given by (28) and (29). 

It does not appear possible to reduce the integral (17) simply to elementary functions, and 
quadrature must be used. The integrand has discontinuities in its first derivative at -r = 1 
and r ~ 2a, so that the integration must be carried out separately over the intervals with 
these as end-points. 


6. Overlap of oieoles on i^'ined rectanole* 


We replace the square A of the previou.s section by arectangle A ; for convenience we assume 
its sides to be fb and !/*/&, where h > 1, so that the ratio of the longer to the shorter side is 
6 and the area is unity. The centres of the circles radii a are assumed to be equally likely to 
fall anywhere in a ‘rectangle with rounded corners’ T, whose boundary is at a distance a 
outside A, i.e. 

T = l-^7ra^ + 2a{^b + llfb). (30) 


To obtain the 2nd moment of the fraction of the area of A not covered, we calculate an 
integral similar to (17) or (18). The function 


is derived from 13(r), which remains the same, but the frequency distribution ^(r), the dis¬ 
tance between a pair of points chosen at random in the rectangle, is different. 

The co-ordinates % and are uniformly and independently distributed in the range 0, 
^/6 (if we take Ox parallel to the longer side). The distribution of w = (aj^-.-Ca)® is thus seen 
from (25) to be l~fiu/h)du 

"' f(m"T 
Jb — 

while the distribution of ~ (i/. — is 


* This problem was solved by Robbins (1945)j see footnote on p. 1. 
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The distribution of r = ^(u + v) is obtained by integrating the product 

^j{buv) 

over that part of the line u-\-v ^ within the rectangle Q^v^ljb, This gives 

(f)(r) == (piir) = 2r[T^ — 2r(^Jb^^]{l|b))-^r^] for r< l/^Jb^ (33) 

<p{r) = — 2r[2a-l/b-2rJb{l-GOBa)] for l/Jb<r<^Jb, ] 

I 

where oc = sin”^ llr^Jb, J 

and </){r) = ^gCr) - 2r[2(a—/^) — b — l/b + 2r sin/^/^Jb 2r^Jb oosoo—r^] 

for ^b<r<^{b^llb), (35) 

where P = cos^^^fi/r. ; 

Thus the 2nd moment of Y can be found from (19)“(21) together with (30) and (33)-“(36). 


7. Overlab ob rectangles on a eixed rectangle 

Assume that the fixed rectangle A has sides a and b and the covering rectangles G have sides 
a and /?. The latter are assumed to be dropped with sides a parallel to the side a and with 
their centres anywhere inside the rectangle T, which is concentric with ab and has sides 
a 4- a and i -t- /?. To calculate the 2nd moment of the fraction YfAoiA not covered by k G's, 
we use (18) and calculate the expectation of ?/x> fraction of T not covered 

by two G's with their centres (a;i, y-^ and (a;^, y^) falling at random in The area common to 
two Cs is readily seen to depend only on the difference g of the x co-ordinates of their 
centres and on the similar difference 7j of their y co-ordinates. In fact, the area can be 
written as = [a - g] W - Vl (36) 

where the symbol* [x] stands for x when x>0 and is zero when x< 0, and we obtain 


alx w a; v ) - 1 + 


(37) 


To obtain the expectation of we need the frequency distribution of ^ and y. As in §5, 
^ is readily seen to follow the frequency distribution 


between 0 and a, with a similar distributon for y, and we obtain the result 


/4(Lri) 


4 r® 


1 + 






(38) 


(39) 


l()^ {a + oc){b + fi) 

If the fcth power be expanded, the resulting integrals are, with the exception of the first, the 
product of integrals whose upper limits are a' = min (a, a) and b' = min (6, /?) respectively. 
We obtain 

4 ffl. 




+ 


a^b^jojo 

* (i 


14- 


(a + <x){b+/3)j 


(a + a) [b + P) 
0 j 0 


a' (*b' 


(a-^){b-y)d^d7i-\ \ {a-^){b-y)d^d7)j. (40) 


.1 


0 J 0 


* The writer is indebted to Neyman & Bronowski for this convenient notation (see below). 
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By a simple change of variable we obtain 






‘ h- 


1 + 


uv - 2ot.p 1 * 


(cj+<x) (64'/?)j 
2a/? 


} {u + a-a)(v + b~fi)dudv 


S’Wl (a+a)(6 + /?) 


+ (41) 


which is the result obtained by Bronowski & Neyman by a rather different method * 

8. Overlap of oieoles on a fixed oibcle 

We now consider a fixed circle A of unit area and therefore of radius b = with k circles 
C of radius a dropped at random with their centres uniformly distributed over a circle T 
of radius a + b. The 2nd moment of the fraction YfA of A not covered is, as in §6, the 
expectation, of q^(r), where q{r) is the fraction of T not covered by two circles with centres 
falling at random in A a distance r apart. We have 

1 + 2a ^jn—na^ 

1 + 2o fn + na^ ’ 

where Q{r), the overlap of two circles radius a with centres apart, is given by (19), (20) 
and (21) as before. We thus need the frequency distribution <j>{r) of the distance between two 
points chosen at random in the circle of unit area to obtain 


q{r) 


(42) 


^'AYIA)==j^ q>‘[r)<j){r)dr. 


(43) 


To do this we use a fairly straightforward geometrical method, finding first the probability 
integral 


F[r) = f f){r)dr, 


J« 


(44) 


F{r) 




~dv. 


which is the probability that the distance between the two random points is less than r. 

The probability that the first point is between v and v^dv from the centre is 2vdvlb^t 
while if 

= area common to circles radii b and r with centres distance v apart, (46) 
it follows that the probability of the second pointbeing within r of the first is A{v)/ 7 rb'^. Hence 

Construct the triangle with sides r, b and and let tlie angles opposite to these be 6, (j> 
and }(/'. Then the following can be readily verified: 

Iff<6, 4('y) - 6^0 + r^^~6rsin^ if 6-*~r<v<6,'| 

— if —rj 

If6<r<26, -4(t;) = W-I'-fsin^ if r'-b<v<b)^ 

- TTb^ if 0< 

The integration in (46) is carried out by parts, with ijr as the ultimate variable of integration, > 
and to do this we obtain the result 

A\v) ^. (49); 


(46) 


(47) 

(48) 


Putting 

we obtain, over the whole range of r, 


2hr sin ^ 
V 

r ==: 26 sin 


^ I 2 / \ rcosia r^sina 

* And by Hobbius (1946). 


(60) 

( 61 ) 
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while differentiation yields the frequency distribution as 

sjn 


94(r) = 2r{7r—a)- 


(4)*2 _ Tj'j.l). 


(62) 


2 cos ' 

Tt will be noted as a matter of interest that the chance of the two random points falling 

3 /3 

further apart than the radius of the circle is 1 — F{b) = or 9/22 nearly,* 

We thus obtain the 2nd moment of the uncovered area from (42), (43) and (62). 

9. Use oe pkobability generating functions 
(i) Binomial 

It is interesting to apply first the binomial distribution of fc, the number of O's dropped 
uniformly and at random on T, Assume that T contains the centres of all the G^s which 
touch or cover A, and that 8 is some larger area including T. If I C’s are dropped at random 
on Sf the probability generating function of the number of centres falling on T is 


G^{u)^ 


8 


- 


(53) 


Thus, from the general result of §4, the mth moment of the fraction of A uncovered is the 

——g-^ j , where q is the fraction of T uncovered by m C’s falling on A. 

But the expression within brackets is the fraction of 8 uncovered. The use of the binomial 
generating function is thus verified. 

(ii) Poisson 

The Poisson distribution next suggests itself. If the number of centres follows this dis¬ 
tribution with a mean of A per unit area, the probability generating function is 

(54) 

and the mth moment will be expectation over A of 

Alternatively, we could write this as 

/C(l^/^)-exp(e-A^), (55) 

where Z ~ area of overlap of m (7’s falling on A. In particular, 

mean value of YjA — jlc[{YIA) = (66) 

Thus the mth moment of YjA is related to the characteristic function of Z, but this result 
does not appear to be of any theoretical importance: it does not, for instance, throw any light 
on the frequency distribution of Y/A, Formula (66) does, however, demonstrate the fact, 
which is otherwise obvious, that the area T does not enter into the frequency distribution of 
the fraction of A not covered by G’s whose fall follows the Poisson distribution. 

As far as the calculation of the variance is concerned, we need to calculate first the 2nd 
moment of In the cases where the falling areas are circles, the area of overlap Z of two 
circles is equal to 2(7—i3(r), where Q(r) is the function given above ((19) etc.) for the area 
common to two G's with centres r apart. The 2nd moment is thus 

g-2A,oJgAn(r)^(^) (5^7) 

where 0(r) is the frequency function of r, the formulae for which ar§ given above for the 
various oases. 

* The solution to this problem (no. 698), given by Whitworth (1807), contains an error, resulting in the incorrect 
value of 35/88 nearly. 



10 The variance, of the overlap of geometrical figure^s 

In the case of rectangles falling on rectangles, the necessary formula for the variance is 
given by Neyman & Brouowski in the form of a series, viz. 

„ (VIA) - V -- 

aaf,2‘,fis!(s+l)2(s + 2f 

X {{s + 2)a-a + [a-a]{l~ajxY'^{{s + 2)b-/i + [/t~b] (1-6//?)*+^}. (68) 

Table 1. Variance of fraction affixed area not covered by areas G 
falling accordiwj to Poisson distribution 


(i) Circles falling on square; (ii) circles falling on rectangle 2x1; (iii) circlcjs falling on rectangle 4x1; 
(iv) circles falling on circle; (v) scjuarcs falling on squarc (Hides parallel). 


of falling 
area C-^sizc 
of fixed area 

Case 

Mean area not covered 

0-25 


0-75 

Variaufc tjf area not covered 

(h2 

(i) 

o-oisii 

o*o3(.)a 

1 

0-0264 


(ii) 

0 '(J182 

0't)29h 

0-0248 


(iii) 


0-0276 

t)«0229 


(iv) 

0‘()HK) 

0-0310 

0*0200 


(V) 

0'()IS2 

0-031H) 

0-0262 

1*0 

(i) 

O'OliOH 

0 -0i)(W 

0*0789 


(ii) 

O'O.')?:) 

0-0904 

0*0743 


(iii) 

(Mnsii 


0-0027 


(iv) 

O’oa2(i 

0-(l9K3 

0-0810 


(V) 

O’orio;) 

0*0945 

0-0781 

1*8 

(i) 

O-OHK! 

0*1262 

0-102(3 


(ii) 

0'0771 

0-1193 

0-1040 


(iii) 

0'06r)7 

0-1016 

0*0824 


(iv) 

0'0829 

0-128i 

0*1040 


(V) 

0-0790 

0-1230 

0*1002 


In general, the variance increases in the following order aa between the dilTerent combinations of sJnqH^si 

(1) Circles on rectangle 4x1, (iii). 

(2) Circles on rectangle 2x1, (ii), 

(3) Squares on square, (v). 

(4) Circles on square, (i). 

(5) Circles on circle, (iv). 

There is an exception in the last case considered, however (mean fraction of area not covered 0*75, size of 
falling area -i- fixed area =« 1*8), when the order is changed somewhat. However, the variatu’cs arc generally of the 
same approximate magnitude, 

(iii) Contagions distribution 

Neyman & Bronowski have included in their study tlie case of a contagious law of typo A 
with two parameters (see Neyman, 1939). Here the probability generating function is 

Osiu) = 

They have pointed out that the expression of this as a series enables calculations to be utilized 
from the Poisson distribution. 

In the general case the sth moment of the fraction of A not.covered is the expectation of 

em<e-''vz_i)^ ( 60 ) 

where Z is the area common to s O’s falling with their centres on A. 
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10 . Numbeioal ebsttlts 

It is impossible to calculate complete tables covering all cases, but it is of interest to calculate 
a few values for the purpose of comparison, and the following combinations of areas have been 
considered: 

Fixed area Falling areas 

(i) Square Circles 

(ii) Rectangle 2x1 Circles 

(iii) Rectangle 4x1 Circles 

(iv) Circle Circles 

(v) Square Squares (with sides parallel to fixed square) 

In each case the fixed area has been made of unit size, while the falling areas were respectively 
C = 0*2, 1*0 and 1*8. The areas were assumed to fall according to the Poisson distribution, 
the number of centres per unit area being such that the expected fraction of the fixed area 
A not covered was respectively 0*25, 0*5 and 0*75. Since from equation (56) the expected 
fraction of A not covered is m = the relations between A and G for the 0 combinations 
of maud Care AC==loge4, log, 2 and log, 4/3. 

The 2nd moments were determined by quadrature from formula (57), where the falling areas 
were circles, and by direct evaluation of the series (58) for the case of squares on squares. 
The results are given in Table 1. 

1 1. Experimental investigation 

Before the work of Robbins and Neyman & Bronowski was brought to the notice of the 
writer, an attempt was made to obtain experimentally a general formula for the variance of 
the fraction of area not covered. Attention was confined to the case of circles falling on 
squares, the centres of the former being chosen randomly (by means of random numbers) 
within the area T whose boundary is at a distance a outside the sides of the unit square A. 

For each combination of C and jfc, samples of up to 200 in size were drawn. The various 
combinations were as given in Table 2, 

Table 2. Ranges of k and G covered in experimental determination of variance 



Three methods were used to measure the fraction of the fixed square not covered in each 
sample. Method P involved the measurement of the covered area by planimeter. Method 
L utilized a photoelectric cell to measure the amount of light passing through a glass plate 
on which black paper disks had been stuck. Method G consisted of a simple count of squares 
on graduated paper, and generally this was the most convenient to operate. (The two neigh¬ 
bouring values of G were used to compare methods L and P.) 

For each combination of G and k the average fraction not covered, T, was compared with 
the theoretical valu'e GjT)^, (61) 
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The. variance, of the overlap of geometrical figures 

Tho observed standard deviation of the observations being s, the aiipropriate criterion for 
testing the mean is Y — m. 

‘-■'s/VP’ 

P being the number of observations in the sample. The results are given in Table 3. 

Table 3. Catnparison beiwe,en observed ami expected values of fraction of area not covered 


P = planimeter method, L = photooleetrie method. 6' & counting method. 


Area of circle 

No. of 
circles 
k 

No. in 
sample 

P 

Mean area not covered 

Standard 

deviation 

3 

Deviation t 

y^tn. 

~ »Up 

Method 

of 

measure¬ 

ment 

Area of square 
sst C 

Observed 

Y 

Exi>ccted 

in 

■■I 

5 

100 

0*9694 

0*9683 

0*0056 

2 -0000* 

P 


10 

100 

0*9394 

0*9376 

0*0088 

2-0464* 

P 


15 

m) 

0-910,I 

0-9079 

O-OOIK) 

2-a889t 

P 

0*()31 

5 

100 

0*8924 

0-8961 

0-0222 

-1-6067 

P 

0'031 

20 

200 

0*8007 

0*8030 

o-om 

-0*9739 

P 

0‘031 

16 

100 

0*7i:i4 

0-7196 

0-0404 

-1-6347 

P 

0*031 

20 

100 

0*6349 

0*6449 

0*0423 

-2-3641* 

P 

0*031 

30 

100 

0*6057 

0-.6179 

0-0493 

-2-4746* 

P 


5 

100 

0*8863 


0-0250 

-6-92CKIt 

L 


20 

100 

0*6326 


0-0500 

0-9600 

L 

0*033 

40 

100 

0*3969 



0-7629 

L 

0*033 

80 

76 

0*1617 


0*0358 

1-4766 

L andP 


120 

60 

0*0657 

0*0612 

0*0256 

1-2478 

F 

0-25 

1 

30 

0*8916 

0*8960 

0*0824 

-0-2260 

a 

0*26 

2 

110 

0*7863 


0*1214 

-1-3477 

a 

0-25 

4 

no 

0*6263 

0*6416 

0*1481 

-1-0764 

€ 

0*26 

6 

no 

0*4B91 

0*6238 

0*1307 

-1-9820* 

0 

1*0 

1 

20 

0*7683 


0-23,50 

-0-1332 

a 

1*0 

2 

50 

0*6890 


0-2340 

0-1025 

a 

1*0 

4 

60 

0*3861 

0-.3430 

0-2334 

1-3067 

a 

1*0 

6 

60 

0*1686 

0-2008 

0*1630 

-1-3968 

a 


* Between 5 and 1% levels. f Beyond 1% level. 


An examination of the values of t shows that too many of them are outside the 6 % 
significance levels, while in each set corresponding to one value of G the values are too 
frequently of the same sign. The worst deviation is for G - 0-033 and fe = 5, with t = — 6-92, 
but this only corresponds to a difference between the observed mean of 0-885 and the 
theoretical mean of 0-800. The te,st is thus very sensitive and the deviations are not serious, 
and they arise from imperfections in the technique which have not been investigated in detail. 

As regards random errors of measurement as distinct from bia.s, it was not passible to 
carry out a systematic estimation of the contribution of this source to the total variation. 
A series of repeated measurements for the case G = 0-031, k = 30, for which the observed 
mean was 0-51, showed that the individual measurements had a standard error of about 
0-009. As the total standard deviation in this case was 0-049, the true estimate of the standard 
error (i.e. omitting the error of measurement) was 7(0-0492-0-0092) = 0-04 S, and for our 
purposes this difference is negligible. There is thus some evidence for assuming that this 
method of estimating the variance was satisfactory. 

12. Deeivation oe empirical formula for the variance 
The consideration that the theoretical variance cr2 of the fraction uncovered must be 
small whenever the mean m of this fraction is near the limits of its range, zero or unity, 
suggests that we might try the relation 

Cr2~m(l-TO) 



















Table 4. Derivation of empirical formula 
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Percentage 
error in 

<S 2 

1 1 M 1 1 1 1 M 1 M 

1 ' 1 1 1 

f. 

1 

/P 

error m 

lr> 0 <Mia 3 COcocOO> 

|jjJ|||||||j|o<NMAt~cj>®« 

Exact 
value of 
variance 

=Q^ 

r~i 

coco<JCJOoo»-HQa 5 

I 1 j J J 1 ) » 1 f 1 . ,t^C<|t^OOOiO^CO 

1 M 1 M 1 1 1 ( I 1 1 ssssssis 

oooooooo 

Empirical 
value of 
variance 
2*3m{l —m) 

1 

0-0000363 

0-0000692 

0-0000990 

0-000684 

0-00116 

0-00148 

0-00168 

0-00184 

0-000785 

0-00188 

0-00192 

0-00105 

0-000461 

0-00736 

001249 

0-0180 

0-0196 

0-0471 

0-0636 

0-0590 

0-0420 


o oo rH 

CO IjO O 

Ol G >1 OH (N 

t+-c 

s °s. 

^ K 
" '73 
t> 

0-00108 

0-00759 

0-00876 

0-0820 

0-234 

% 

< 

1 

r—H 

CO CO O ' ——“ -« . .. 

t'^C 3 >COO’ 5 t<OOtrqoo<-«i'^CO<M •*’ — “ -- 

05 C>HC»C<^O«-mq 0 I~'C 0 I>*C 0 I'-.C^CC>-^'^^-- 

0 »-Hoii:^i>*cct--^cDOioo>-^iM(jqiooot'-coi—tco 
0 OOOOOOQO^OOi--«r'< 3 ia 5 C 0 OOj ^<0 
OOOOOOOOOOOOOOOOOcoC"ic5jr-4 

ooooooooooooooooooooo 

Observed 

variance 

=5 5 ® 

. .. 

CO O 

o r- CO lo o 

ccjL'-oooiiMcoasccoHOioootrjcfc 

OOOri'r-^cOl— T#C0»QC0<NC0t'-t'-‘05»~-i<MOlC5C0 
OOOOO^^QOOOOOO^C^ 1 '~wiOirawb<N 

oooocSoooooooooopooooo 

ooooooooooooooooooooo 

Ratio 

TjO 

I ^ r-H r -4 rM ip lO »p »0 »p lO lO lO CO CO CO oo 

v: 3 iOLbocbcbcbcbcococwc??cbo>c>cbd>-^'^rH"^ 

Xo. of 
circles 
k 

iOO>OiOOiCiOOu:>0000»-H<M’«!i<COr-HM^CD 
»— (1—<<NCO <M^OO<N 

p—f 

1 


I 

CD 

II 

c6 


r-** f—*< /-Hi. rw t—( cO •CO CO 

ooococo^ococococo^cocOROuoirsio 
OOOOOOOOOOOOOC^CSG^ICKIOOOO 
OOOOOOOOOOOOOO O'O o A A ^ 
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for given G. Accordingly we have calculated the quantity 

_ 

^ ” m(l —m)’ 


( 02 ) 


and the results are given in Table 4. 

For each value of 0 the values of g are by no means constant, but the variation is not 
excessive, and it is considered that for practical purposes we can take g to be a function 
of 0, at least over the range considered. To obtain a suitable form for this function, it was 
decided, for very general reasons, to seek a simple relation between g{0) and T/C, the latter 
being roughly the number of C’s which could bo placed on T if they could be fitted together 
without overlapping. 


Table 5. Oomparison of empirical formula for variance with exact value 


= exact value, = empirical value, ^ — percentage error = 100 — 


Tta^ 


k^l 

k^2 


A : s=s 4 

^ = 5 

k = 6 

0*2 

<r 2 

0*00501 

0*00873 

0-0114 

0-0132 

0-0144 

0-0150 



0*00510 

0-00896 

0-0117 

0*0135 

0-0147 

0-0164 


E 

+ 3-0 

+ 2*6 

+ 2*0 

+ 2-3 

+ 2*1 

+ 2*7 

04 

0-2 

O'Ol.'iO 

0*0246 

0-0290 

0-0305 

0-0300 

0-0274 



0*0140 

0-0237 

0-0284 

0-0304 

0-0305 

0-0294 


E 

-~ 4*5 

- 3*7 

- 2*1 

- 0*3 

+ 1-7 

+ 7-3 

0-6 

cr® 

0*0277 

0*0403 

0-0447 

0*0427 

0*0395 

0-0339 



0*0257 

0*0384 

0*0431 

0*0433 

0-0407 

0-0370 


E 

-~ 7*2 

- 4*7 

- 3-6 

+ 1-4 

+ 3-0 

+ 9-1 

0*8 

<72 

0-0398 

0*0541 

0 - 0 .' 5 r )2 

0-0501 

0*0428 

0-0351 


0-^ 

0*0305 

0*0517 

0-0561 

o - or ) 2 ,'j 

0-0471 

0*0407 


E 

- 8*3 

-- 4.4 

- 0-2 

+ 4*8 

+ 10-0 

+ 16*0 

no 

(72 

0*0508 

0-0651 

0-0028 

0-0540 ' 

0-0436 

0*0339 



0*0471 

0-0636 ! 

0*0648 

0-0590 

0-0507 

0*0420 


E 

- 7*3 

- 2*3 1 

+ 3*2 

+ 9-3 

+ 16-3 

+ 23-9 

1*2 

0-2 

0*0605 

0*0737 

0*0677 

0-0554 

0*0427 

0-0317 



0*0571 

0*0740 

0*0724 

0-0636 

0*0525 

0*0419 


E 

- 5*6 

+ 0*4 

+ 6*9 

+ 14-6 

+ 22-9 

+ 32*6 

14 

(r 2 

0*0689 

0-0804 

0*0706 

0*0554 

0*0410 

0*0292 


0 * 1 ^ 

0*0667 

0*0832 

0*0786 

0*0665 

0-0531 

0*0411 


E 

- 3*2 

+ 3*5 

+ 11*3 

+ 20*0 

+ 29-5 

+ 40*7 

no 

0-2 

0*0763 

0*0855 

0*0723 

0-0546 

0-0389 

0*0267 



0*0757 

0*0913 

0*0834 

0*0684 

0/0530 

0-0398 


E 

- 0*8 

+ 6*8 

+ 15*4 

+ 25*3 

+ 36-3 

+ 49*1 

1*8 

0*2 

0*0828 

0-0895 

0*0730 

0*0531 

0*0367 

0-0244 



0*0843 

0*0985 

0-0873 

0*0695 

0*0524 

0-0383 


E 

H - 1-8 

+ 10*1 

+ 10-0 

+ 30-9 

+ 42*8 

+ 57*0 


Fig. 2 shows the result of plotting the observed mean value of ff(C) against T/G on log¬ 
arithmic scales. The points (the values of for various values of k are plotted in 

addition to the mean^ lie reasonably close to a straight line of slope — 1-5, indicating the 
relation g(G)~(CjT)i. (63) 

The values of {TIG)^g{C) are shown in Table 4, where the values are seen to lie between 2 
and 2*5 with an average of 2*3. Thus we derive the rough empirical formula 


2 2-3m(l—m) 

(TIG)i~~’ 
TO= (l-O/T). 


where 


(64) 
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The variance of the overlap of geometrical figures 

These are given in Table 4, together with the values in some cases of the percentage error of 
o-\ compared with the true value obtained from the methods described in §5; the per¬ 
centage error in the estimate 8^ observed experimentally is also given. (It was not possible to 
evaluate cr® in all cases, as the computation is somewhat laborious.) Another set of com¬ 
parisons between the empirical and the exact formula is given below in Table 6, over the 
range C = na'^ from 0-2 to 1-8 and k = 1,2, 3,4,6,6. 

It will be seen that the empirical formula gives quite a satisfactory fit, e.g. with an error 
less than 10 %, over a considerable part of the range studied, but that the error tends to 
increase, i.e. the formula exaggerates the variance, as G and k increase. 


13. Use of the empikical foemula foe the Poisson case 


If k circles are dropped with their centres falling at random on the area T the mean area not 
covered can be written as = (1 — CIT)’‘, 


and the empirical formula for the variance as 


I'S) 


2-3m(I')[l-m{*)] 

(TjOf 


Table 6. Comparison of empirical formula for variance of area not covered 
in case of circles falling on square according to Poisson distribution 


Srae of fallmg 
area (J ri- 
fixed' airea 


Man area not covered, m 

0*26 

0*6 

0-75 

0*2 

0-2 

0*0186 

0*0303 

0*0254 


O ',2 

0*0190 

0*0308 

0*0257 


% error 

54 

1*7 

1*2 

10 

0-2 

0*0608 

0*0964 

0'07a9 



0*0669 

0*0981 

0*0781 


% error 

10*0 

1*8 

1*0 

1'8 

0*2 

0*0816 

0*1262 

0*1026 



0*0942 

0*1348 

0*1057 


% error 

154 

6*8 

3*0 


Hence if k follows.the Poisson distribution with expectation ^T, the total variance based 
on the empirical formula is 


cr!= S 

fc-n 


k\ 




where m = expected area not covered 

= e-^o. (66) 

We find after expanding m\k) that 



This formula is compared with the exact values in Table 6 over the same range as in Table 1. 
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The agreement is again reasonably satisfactory over the greater part of the range, large 
positive errors occurring for large values of G and small values of These errors might be 
reduced by using a constant rather smaller than 2*3 in the empirical formula, but the point 
has not been investigated further. 

Summary 

The mathematical study of bombing has given rise to the following problem. A fixed 
outline, such as a square or circle, is draw^^ on a plane, and other similar outlines are dropped 
at random on it. Estimates are then required of the variance of the fixed area which is not 
covered. Work by Rabbins enables a theoretical formula to be derived, and Bronowski 
& Neyman have treated, by an independent method, the special case of rectangles falling 
on rectangles. 

It is shown that in the case of circles falling on. circles, squares or rectangles, the variance 
can be expressed as the integral with respect to r of tJm product of two functions, one being 
a simple function of the area of overlap of two circles with centres r ax)art, and the other 
being the frequency function of tlie divStance r between two i)oiiits chosen at random in the 
‘ covered ^ area. This applies both to the case where the number of falling areas is fixed and 
where it follows a Poisson distribution. Numerical values have been calculated for a 
number of cases. An experimental method had been carried out prior to the above theo¬ 
retical work, and the following em|)iidcal formula was derived for the variance of the 
fraction of a fixed square not covered by k circles, area falling at random on an area T 
containing centres of all O’s which cover or touch the fixed square: 

" {Tfdji 

where m = mean fraction of area not covered 

This formula, and its extension to the Poisson case, have been shown to be in reasonable 
agreement with the exact values over a considerable range. 

The writer is indebted to Miss G. 0. Jeffcoate for valuable assistance in the computing 
and experimental work. 
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A STUDY OF A FIRvST DYNASTY SERIES OF EGYPTIAN SKULLS 
FROM SAKKARA AND OF AN ELEVENTH DYNASTY SERIES 

FROM THEBES 

By A. BATRAWI, Ph D* and G, M. MORANT, D.So. 

L Introduction. This paper deals with forty-four nmle crania of 1st dynasty date 
(c, 3400 discovered at Sakkara by Macramallah Effendi, who has published a report 
on the excavations (1940), and with fifty-five crania of IHh dynaaty (c. 2000 b.C.) soldiers 
unearthed at Thebes in 1927 by the Egyptian Expedition of the Metropolitan Museum of 
Art, New York (Winlock, 1928), The cemetery at Sakkara, 20 miles south of Gixeh, was used 
by the middle classes of the local community, Prof, D. E, Derry has kindly provided the 
following notes on it: 

The Ist dynasty cometory at Wakkara excavated by Macrtunallah EfTondi is of special interest. 
Conuparativoly few cemeteries of this date have been found, and, while the total number of forty-four 
skulls from which reliable measurements coidcl be taken was atnalU yet the results yielded by these 
are sucli os to show that we are dealing with a race whicli dilTers in important features from those 
exhibited by the so-called predynastic lieoplo* 

The observation that there were two races in Egypt in the early dynastic period was first made in 
fcl)6 year 1909, when the results of mooHurementB obtained from a sories of male and female skulls of 
the 4th and 5th dynaafcios from the great uenropoUs surrounding the pyramids of (liza eaino to be 
examined and compared with crania from early pi-odynnstic graves. Until then the theory of an 
unbroken evolution of the Egyptian raco from prehistoric, times right throvigh the dyxvastic period had 
been taught. It now became obvious that the culture which wo kmiw of as peculiarly lilgyptiaa was 
associated with a race which could not have boon derived from the prodynastic people. The introduction 
of stone-working rasulting in the erection of great tombs and statuary, m well a« beautifully executed 
reliefs, paintings and above all writing, all pointed to a race far in advance of tlio predynastic people, 
who although skilled in the making of bowls and vases in stone as well as in pottery, and who had 
already attained to the discovery of the uses of copper, wore, nevertheless, little removed from the 
Neolithic period. 

The cemetery is unusual in consisting entirely of males. In the note on the skulls published in 
Macramallah Effendi’s report it is stated that there were some females included in the collection. After 
the report had gone to press Macramallah Effendi informed the writer that a part of the cometory 
was of 18th dynasty date. It turned out that all the female skviUs came from this part and that tliorefore 
the Ist dynasty cemetery contained only remains of males. Dr Batrawi's examination of the figures 
confirms the statement made at the beginning of this note and shows the cIoaonesB of the relationship 
of the people of the 1st dynaaby at Sakkara with those of the 4th and 5th dynasties from Deahoslieh 
and Medurn. 

In his report on the discovery of the aeries of 11th dynasty skeletons at Thebes Mr H. E.^ 
Winlock (1928) says that they were found in tomb in the row wliere the grandees of 
Mentuhotep*s court had been buried’. He remarks; 

Obviously what we had found was a soldiers’ tomb. To judge from the cheapness of their burial they 
were only soldiers of the rank and file, and yet they had been given a catacomb presumably prepared 
for the dependents of the royal household, next to the tomb of the chancellor Khety. Clearly that was 
an especial honour. If we are right in supposing that all had been buried at once, they must have been 
slain in a single battle. 

Prof. Derry examined the bodies on the spot, and he took measurements of the crania 
and of some of the long bones. About sixty bodies were counted and all proved to be those 
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of adult males who had died in the prime of life. Prof. Derry says that the skeletons were 
reburied after they had been examined. 

2. The measurements of the crania* The Sakkara series was sexed and measured by 
Prof. Derry and we are indebted to him for allowing us to use his records in this paper. 
All the absolute measurements, given in Appendix II, are his readings with the exception 
of those of the foram,en magnum, which were taken by one of the writers (A.B.) of this 
report. The measurements of the Thebes series were also kindly provided by Prof, Derry, 
together with means he had calculated. The readings for individual crania are given in 
Appendix III. 

The technique of measurement followed by Prof, Derry is that of the Monaco Congress 
(Duckworth, 1913). He had used this when measuring the predynastic Egyptian series of 
skulls from Badari, of which part was remeasured later in London by Miss B. N. Stoessiger 
(1927), who followed the biometric technique. The two sets of measurements of the same 
fifty-tliree specimens liave been compared (Morant, 1935), thus showing in detail what 
relations are to be expected l)etwcen readings obtained by following the two techniques. 
These results were taken into account in preparing tlie definitions of Prof. Derry's measure¬ 
ments given in Appendix I below. The characters are denoted as far os possible there and 
in the tables by the c\istomary index letters of the biometric technique. 

3. The nature of the two series. Moan measurements and standard deviations for the two 
series are given in Table 1. The longest series of Egyptian skulls measured, known as the 
E series, came from a cemet(n\v at Giza used from the 26th-“30th dynasties (Davin & 
Pearson, 1924). Judging from comparisons of constants for a number of cranial characters, 
most of the other ancient Egyptian series described exliibit almost precisely the same order 
of variation as the one from Giza. Iti general they have been found to be rather less variable 
than European cranial series, while there is no evidence that there was any appreciable 
change in the variation exhibited by Egyptian populations during the long period from 
early predynastic to Roman timCvS. 

The two new seric.s are shorter than several from Egypt previously described. Counting 
the number of characters for which the standard deviation for one series is greater or less 
than the corresponding constant for the other series, the situation is: 

Sakkara and Thebes: Sakkara ,s.i). greater for nine and less for ten characters; 

Sakkara and Giza: Sakkara s.i). greater for ibur and less for eleven characters; 

Thebes and Giza: 'rhehes h.d. gnuiter for eight and less for nine characters. 

This crude comparison suggests that there can have been no marked differences between 
the variabilities of the three ])()pulatians represented. As sots of differences are considered, 
the limit of significance accepted may be talcen considerably higher than in the case of 
a single difibrence. Supi)osc tliat there is a real distinction if two of the standard deviations 
differ by an amount which is 3-5 or more times its probable error. Then one significant 
difference is found for the Sakkara and Tliebes series (NH, L, Sakkara s,d. greater, 
ri/p.E,zl = 3’8), none for the Sakkara and Giza, and three for the Thebes and Giza series 
{H' 4*1, 3*5, S 2 3*5, Giza s.n. the greater in all three cases). The two new series are too 

short to give reliable comparisons, but the evidence suggests that the populations they 
represent were equally homogeneous, while both were rather less mixed in racial com¬ 
position than the 26th-30th dynasty population of Giza. 
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4. Comparisons of mean measuremmts. Following biometric practice, it may be Kupposed 
in such a case that no fstatistical analy.sis of the aeries can-reveal its racial comjmnents. 
The relationships of the aeries have to be iu<lgecl by (somparing them as wholes, on the basis 
of mean measurements, with other aeries known to exhibit unexceptional variation. It 
was shown by Morant (1925) that the recorded aeries of ancient Egyptian skulls can be 
divided into two groups. These were called, for convenience, the llpper and Lower Egyptian, 


Table 1. Means and standard deviations [mth prohalde. errors) of the Sakkara Ls/. dynasty 
and Thebes llth dynasty series of male, skulls 



Means 

Standard deviations 

Character* 

Sakkara 

Thebes 

Sakkara 

Thebes 

L 

180*9+ 0-56 (41) 

181-8-f 073 (.74) 

,7-31+0-40 

7-7.7 + 0-37 

B 

138*7 ±0*41 (43) 

1,38-3-+-0-41 (.74) 

3-99 + 0-29 

4*52 + 0*29 

B' 

90*5 + 0*39 (30) 

93*0+ 0*43 (55) 

3*48 + 0*28 

4-72 + 0-30 

IV 

135'4 + 0'07 (32) 

137*1+0*37 (51) 

,7-03 + 0-47 

3*91+0*26 

[Aur, hfc.] 

114*8+ 0*07 (27) 


7-20 + 0-48 

. 

LB 

l{)2-7-K)-r)7 (29) 

iiH)-7+0‘34 (40) 

4-.77 + 0-4() 

3*40+ 0*24 

U 

,718-8 ±P.‘) (29) 

507*4+1*3 (50) 

12*0 +M 

13*2 +0*89 

Bs 


12.7-7 + 0-47 (.72) 

..... 

5*01+0*33 



129-4+ 0-.7« (53) 


(1-00 + 0-39 


— 

115*2 + 0*82 (51) 


8*03 ±0*58 


— 

370*5+1*2 (50) 

.... 

12*7 +0*80 

[Broca's Q'] 


300*3 + 0*84 (49) 

.... 

8*72 ±0*59 

fml 

30*7 +0*20 (31) 


2-18 + 0-19 

— 

fmb 

30*4 + 0*22 (29) 


1*74 + 0*15 

—. 

[G'H] 

71*9 + 0*55 (30) 

72*0 + 0*37 (45) 

4*43 + 0*39 

3*71+ 0*20 

GB 

00*5 + 0*62 (25) 

96*5 + 0*48 (38) 

4*57 ±0*44 

4*40+ 0*34 

J 

127*8 (14) 

127*0 + 0*52 (32) 

...... 

4*33 + 0*36 

[NH, L] 

51*2 + 0*50 (29) 

51*8 + 0*25 (45) 

4*00 + 0*35 

2*52 + 0*18 

NB 

25*4 + 0*21 (30) 

25*0+ 0*20 (42) 

1*70 + 0*15 

1*92 + 0*14 

[Oil 

38*9 + 0*24 (20) 

39*1 ±0*10 (44) 

1*84+ 0*17 

1*55 + 0*11 

[Od 

32*5 + 0*20 (20) 

33*1+0*23 (44) 

1*50 + 0*14 

2*23 + 0*16 

[Prosthion GL] 

99*0 + 0*56 (20) 

96-5+ 0-47 (43) 

4*23 + 0*40 

4*61+0*34 

100 B/L 

74*2 + 0*26 (39) 

76*1+0*26 (54) 

2*44 + ()*19 

2*84 + 0*18 

100 H'jL 

72*8 + 0*41 (30) 

75*6 + 0*29 (51) 

3*37 + 0*29 

3*00 + 0*20 

100 BjIV 

102*6 + 0*60 (31) 

100’8±0*4l (51) 

4*95 + 0*42 

4*34 ±0*29 

100/m6//wi 

83*3 + 0*72 (29) 

— 

5*76 + 0*51 

— 

[100 G'HIGB] 

74*3 + 0*50 (25) 

76*8+ 0*63 (38) 

3*70 + 0*35 

4*83 + 0*37 

[100 NBINH, L] 

49*5 + 0*58 (29) 

48*3 + 0*48 (42) 

4*63 + 0*41 

4*61 ±0*34 

[100 ojon 

83*0 + 0*61 (20) 

84*0 + 0*49 (44) 

3*80 + 0*30 

4*81 + 0*35 


* The characters are defined in Appendix I. A symbol in square brai^kets denotes either that thcj meiasuremcnt 
is one nob usually included in the biometric technique, or else that Prof. Derry^s inotliod (»f taking the incasurc- 
monta does not accord with biometric practice. 


though there is evidence that the regions represented changed somewhat with time. The 
series in the first group came from the neighbourhood of Thebes and sites farther south, 
while those in the second group came from the same region of Upper Egypt and sites 
farther north. The first gi'oup includes all the predynastic series that have been described 
and some of dynastic date, the latest being of the 18th dynasty: the second group ranges 
from the 1st dynasty to Roman times, though no series available earlier than the 4th 
dynasty had come from the region immediately south of the Delta. The iSakkara series 
described in the present paper extends the range of such material back to the 1st dynasty. 
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It had been found that the means for all these series are almost constant for most of the 
metrical characters commonly recorded, but for a few measurements more significant 
differences are found and these separate the two groups of series. Characters of both kinds 
are treated in Table 2, which is based on Table XIII in Risdon’s paper (1939) on the human 
remains from Lachish (Palestine). The first six characters are those which make the clearest 
distinction between the Upper and Lower Egyptian types of series, and they are all breadths 
or dependent on breadths— the latter being the horizontal circumference and the two 
indices—of the cranium. The Sakkara series is clearly assigned to the Lower Egyptian 
group, and if counted as a member of this the range of the mean minimum frontal breadths 
(JS') for the group is slightly extended. The Thebes aeries is also assigned to the Lower 
Egyptian group by four of the six characters in question; for U and lOQBjH\ however, 
its means fall within the ranges given for the Upper Egyptian type of series. 


Table 2. Raiige.s of metui 7neasure,ments for tivo groups of series of ancient Egyptian 
male crania and means for the Sakkara and Thebes series'^ 


Scries 

Foriod 

B 

J 

B' 

U 

Upper Egyptian type 
Sakkara , 

Thebes 

Lower Egyptian type 

Early predym-lBth dyn. 
ist dyn. 

llth dyn. 1 

1 at dyn,~Unman ^ 

13i4-134-3(10) 

138*7 

138*3 1 

135*3 439*3(9) 

12U-8-127-f)(8) 

127'8 

127'6 

127'G-131-3(8) 

90*4-92*8 (4) 
96%5 

93-6 

93*0-96*2 (5) 

500*0-610*4(4) 

518*8 

5074 

510*8-618*7 (5) 


Series 

Period 

100 BIL 

im Blip 

L 

ir 

Upper Egyptian type 
Sakkara ! 

Thebes 

Lower Egyptian type 

Early predyii.-lBth dyn. 
Ist dyn. 
llth dyn. 

1st dyn.-Roman i 

71*7-73*7(10) 

74*2 

76-1 

73*7-76*0 (9) 

98*1-1{)M(U)) 
102*6 1 
100*8 

102*3-1064 (9) 

182*2-185*2(10) 

186*9 

181*8 

181*4-185*8 (9) 

1324-135*9(10) 

1354 

137*1 

130*7-136*0 (9) 


* The character.^ arc defined in Appendix 1. The numberH in brackets give the nunihera of series to which the ranges 
relate. In the case of these previously described series the smallest iiuinbor of crania on wliieh any one of tho means is 
based is 16, though this miniinuni number is ahotit 30 for most of the characters. The numbers on which the Sakkara 
and Thebes means are based can bo seen from Table 1, the only one less than 20 being 14 for the bixygomatic breadth 
(J) of the Sakkara series. 


The last two charactevK in Table 2, whicli are the length and height of the cranium, fail 
to distinguish the two contrasted groups of series. Tlie means for the two new series fall 
outside the ranges previously given by all tho ancient Egyptian material, the Sakkara 
series giving the greatest L and the Thebes the greatest //'. The evidence of other characters 
must be taken into account, but so far the comparisons suggest that the two new series 
are of the Lower Egyptian type, and it is to be expected that they bear a closer resemblance 
to some of the series assigned to tliat group than to any other cranial series. 

At the same time it may be noted that the Sakkara 1st dynasty and Thebes 11th dynasty 
populations are clearly differentiated by their mean cranial measurements. Tliere are twenty 
characters in Table 1 for which means for both series are available. The most significant 
difference is for L, and it is 6^6 times its probable error, while five other characters—U, 
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prosthion OL, 100 5/L and 100 //'/L—alHO show differences whieli exceed four times their 
probable errors. 

5. Oomparisons by coafficients of radal likeness. The method of Karl Pearson’s coefficient 
of racial likeness has been applied extensively to series of ancient Egyptian crania. Risdon 
(1039) has given comparisons made in that way for twenty-two male series, including three 
from sites outside Egypt, and the treatment below is almost re.strioted to comparisons 
between these and the two new .series described in the present paper. Tlie procedure fol¬ 
lowed in applying the method described in several papers in Biometrika was adopted without 
modification.* 

In deriving a classification of a number of cranial, or living, aeries from the coefficients 
of racial likeness found between them, it has been shown repeatedly that the most sug¬ 
gestive arrangement is obtained if the closest resemblances of the series, indicated by 
coefficients below a certain value, are alone taken into account. Risdon has given a diagram 
(1939, Eig. 3) showing all the reduced coefficients less than .'l-O between the twenty-two 
aei'ies with which he dealt. There are fifty-three of this lowest order among the 231 
( = 22 X 21/2) comparisons. The addition of the two new aeries to the classification referred 
to only requires a knowledge of the reduced coefficients less than S-O between them and 
tlie twenty-two series. 

It has been pointed out that inspection of a few mean measurements can indicate whether 
a comparison of two particular series would almost certainly give a reduced coefficient 
greater than the limit chosen, or whether it might provide a value less than r)-0. The 
measurements used for this rapid test are six which are known to be those which show the 
most significant differences, and the greatest proportions of such differences, in comparisons 
of the group of series. These are the length, breadth and height of the brain-box and the 
three indices derived from these chords. Por the fifty-three comparisons of the twenty-two 


* A ‘crude* cooflicient ia defined by 


-Sf 1-H0-6746■/ 

m ln,+n; o-/ J - V 


/2 

m* 


where ia a mean, based on orania for the first series, M/ and n/ aro the eorreaponding eonstants for the 
second series and m characters are compared. The cr's of the long 26th“-30th dynasty Egyptian series were 
used throughout. The crude coefficient may be written 


-l-£f(a)-l + 0-6745 
m ^ ^ 



where a 




Its. value ia largely determined by the sizes of the two samples that happen to be available, if in fact they do not 
represent the same population, As many excavated crania are damaged to some extent, in the ease of a particular 
series means for different oharatiters will usually be based on various numbers of specimens (see 'l.'al)le 1). l*he 
mean number available for tlie characters used is denoted by in the case of the first series and by 7i/ in the 
case of the second aeries, and these ‘ sizes* of the samples are usually unequal and may be of very difterent orders, 
To obtain, as far as possible, a measure of the absolute divergence of the types compared which does not ilepend 
on the numbers of orania available, a ‘reduced’ coefficient of racial likeness is computed. This is doiiiicd to be 


100 X 100 
100 +100 


i.5(<x)-l+0'6745 
m ~ 



A reduced coefficient may be supposed a good approximation to the value which would be obtained if all the 
means for both series were for 100 individuals instead of for the numbers actually available. If a crude 
coefficient differs from zero by less than 3*5 times its probable error—a rare occurrence—then it is supposed 
that there is no evidence of a signiheant distinction between the two populations represented. In this case 
there is no need to compute a reduced coeffioient. Otherwise, reduced coefficients are found and the classification 
of a number of series is based on these, 
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series giving reduced coefficients less than o-O, the maximum differences for the six 
characters (in mm. or units of the indices) are: 

L B IV UH) Bjh KX) IVIL 100 BIE' 

3*1 3-0 3*5 2*0 ’ 2*3 2*8 

To avoid the danger of missing comparisons which might be of the order required, in 
applying the test each of these values was increased arbitrarily by 0*2 giving: 

L H IV i(K) UIL UK) IVIL 100 BfW 

:b3 3-2 3*7 2*2 2*5 3*0 

In comparing a new series with tlie twenty-two it may be supposed that a reduced 
coefficient of racial likeness greater than 5*0 would almost certainly be found if the dif¬ 
ference between the means is greatcu’ than the accepted limit in tlie case of any one or more 
of the six characters. For such comparisons the cioefficients were not calculated. If the 
differences between the means are less than the limits for all six (tharacters then a reduced 
coefficient less than 5*0 might bo found: the coefficieutH were calculated in all such cases. 
In this way detailed comparisons were judged to be reejuirod between the new Sakkara, 
1st dynasty, series, on the one luincl, and six of the twenty-two treated by Risdou on the 
other; and between the now Thebes, lltli dynasty, on the one hand, and only two of the 
twenty-two series on the other. TTxe previously tloscribod scries involved in these two sets 
of comparisons—one serit's btung iticiuded in both sets—are; 

(i) Deshaslieh and Medunu 4th and 5th dynasties (Thomson & Maciver, 11)05). The two 
towns are south of Sakkara anti both less than 40 miles from it. 

(ii) Gizeh, 2(5th -30bh dynastu^s (Davin & Pearson, 1024). 

(iii) Sedment, 9th dynasty (Woo, 1930). 

These three and the new Hakkara series are all from Lower Kgypt among the total 
twenty-four series referred to above. All the otlier Bgyptian sites mentioned are in Middle 
Egypt and close to Abydos and TTielxes. 

(iv) Abydos, 18th dynasty (Thomson & Maciver, 1905). 

(v) ^Abydos, 1st dynasty, royal tombs (Morant, 1925). 

(vi) Lachish, Palestine (Risdon, li)39), Tiiis series represents an Egyptian population. 
It is assigned to the seventh and eighth centuries b.o., thougli it is not well dated. 

(vii) Tigro district, Abyssinia, modern (Sergi, 1912, nieans given in Morant, 1925). 

(viii) Cretans, modern (von Luschan, 1913, means given in Woo, 1930). This series is 

not one of the twenty-two dealt with by Risdon. It was included because of its close 
resemblance to the new 1 Ith dynasty Heries from Thebes. The test based on a comparison 
of the means of the six calvarial meaHuremeixts shows that the only aticient Egyptian series 
which might give reduced coenieieuts loss than 5*0 with tlie (!rctau scries are the Theban 
11th dynasty and the Hedmcmt series ((iii) above). 

It must be emphasr/.ed that a reduced coefficient ofracial likeness less than 5*0 represents 
a very close degree of resemblance. Values of that order have only been found between 
cranial series which would be expected, on account of their provenance, to represent the 
same or closely related populations. There is a danger that low reduced coefficients may be 
misleading owing to the influence on them of extraneous factors, such as inaccuracy in 
sexing or slight and unappreciated differences between the methods of measurement of 
two recorders w'orking independently. It is safe to suppose that the two new series are 
made up entirely of the crania of adult males. In computing coefficients with them care 
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was taken to restrict a particular comparison to pairs of means based on measurements 
obtained by following precisely the same technique. 

Owing partly to that restriction, the numbers of cliaracters that could be used in com¬ 
puting coefiicients with the new series are decidedly smaller than the lU used ideally for 
the purpose, For these comparisons the smallest number of characters used is f) and the 
largest number is 18.* Ilisdon pp. 131--2) has examined the matter experimentally 

and he concluded that use of a smaller number of characters—the set of 14 he considered 
being very similar to the sets wo were able to use—-can usually be expected to give a fairly 
close approximation to the result which would be obtained from about twice ms many 


Table 3. Clo&jjicimls of racial liiccnm be.lwe.m ancient Egyptian, a Palestinian (Lachish) 
and modern scries of male skulls from Abyssinia and Crete* 


Series 

Crude 

C.U.L. ± r.K> 

Redu<*ed 

aii.L. 

Sakkara, 1st dyn. (32* 1 ] with Dt^ftliasheh and Medum, 4th and 5tli dyn. (40*0) 
(32'1) with Abydos, IStli dyn. (494)) 

(31*0) witli Laclush (24i)*3) 

(3!*(i) with Gizeh, 20th-3{Jth ciyn. (HHr>'7) 

(31-0) with modern Abyssinian (01*4) 

(31*0) with AbvdoH, lst dyn. royal tombs (33*0) 

(30*3) with Thehes, llth dyn, (40*7) 

0'19±0*32 (9) 
-0*08 4- 0*32 (9) 
l*04 + 0-2r) (14) 
2*454 0*25 (14) 
2*43 ±0-25 (14) 
l'01+0*2r) (14) 
4’26±0*22 (IS) 

1 •8.')-)-0-4.5 

4- 02 4-041 
.5-82-1-0-60 

5- 87-1-0-77 
ll-4f)±0-,59 

Thebes, llth dyn. (49*2) with Mcdment, 9th dyn. (37*9) 

(49*0) with modern (Cretans (50*4) j 

(48*3) with Deshashehand Medvim, 4th and 5th dyn. (46*0) 

-0*43 + 0*28 (12) 
2*01+ 0*30 (10) 
2*23 ±0*32 (9) 

4-04-t-0-60 
4-73 ±0-68 

Sedment, 9th dyn, (37*5) with Deshoaheh and Medum, 4th and 5th dyn. (39*9) 
(37*7) with modern. Oebans (47*9) 

l*88±0*25 (14) 
2-71 ±0*25 (15) 

4-86-f 0-85 
0-42+ 0-69 

. 


* The numbers in brackets following the names of the series are the mean numbers of crania for the characters 
used in computing the Goefhcieiibs. The numbers in brackets following the crude coefficients are the numbers 
of characters on which they are based. Woo (1930) gives coefficients with two of the series in the table above, 
and the values there differ from his because they were recalculated omitting the term \ hn, which was discarded 
after 1930. The standard deviations of the long E series of 26th-30th dynasty crania from Gizeh (Davin & 
Pearson, 1924) were used in computing all the coefficients in the table. 


characters. Occasionally, however, use of a smaller number of characters may suggest 
a rather misleading conclusion, and it will tend to indicate a rather wider separation of the 
series than that which would be found if all 31 characters could be used. With these reserva¬ 
tions in mind the coefficients with the new series may be accepted as the best approxima¬ 
tions it is possible to obtain in the circumstances. 

All the coefficients of racial likeness found with the Sakkara Lst dynasty and Thebes 
11th dynasty series are given in Table 3. Fig. 1 is a reproduction of part of a diagram given 
by Risdon (1939, p. 137) for the twenty-two ancient Egyptian and related series with which 
he dealt, with the addition of the two series described in the present paper and that of 
modern Cretans, The Sakkara series is seen to be an unexceptional member of the ‘Lower 
constellation, having two insignificant coefficients and other close connexions 

* The characters common to all the comparisons with the new series are L, 7i, LB^ J, NB^ 100 BIL, 
100H'jL and lOOBjH', Others used in some cases are t/, S,fnil,fmb and 100 fmblJinli aud for the coefficient 
between the two new series only O'H, NH, L, 0, 100 G'HjQB, 100 NBjNH, L, 100 Og/O/. 
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with members of that group. On the other hand, the 11th dynasty soldiers from Thebes 
clearly represent an Egyptian population of an aberrant type. The direct comparison fails 
to dtstinguish this from that of the 9th dynasty series from Sedment. Woo (1930) had 
found that the latter stands apart from all the other Egyptian series, and he pointed out 
that the Sedment bears a closer resemblance to a series of modern Cretans (von Luschan, 
1913) than to most of the ancient Egyptian series. The Thebes 11th dynasty series has 
a reduced coefficient less than 5-0 with the Cretan, though the latter has no other coefficient 
of this order with any of the other Egyptian series. 

6. The racial history of ancient Egyptian populations. The new evidence makes rather 
■ more precise the racial classification of ancient Egyptian populations given in earlier 
craniologioal papens in Biornetrika. The 1st dynasty series of crania from Sakkara is the 
earliest in date that has been described representing the region immediately south of the 
Delta. It is an unexceptional representative of that group which must have prevailed 

Inaignificanl 

Significant anti < 3'5 — — 

3-5-5-0 . 
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with ‘upper 1 
Egyptian’ ^ 
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(1st dyn.) ' ‘ , 


' » —-.l^sliasheh 
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^js.*Dcn^crali Thebes^ ^ 
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' ‘(Roman) 


‘ Gizeh 

(26th-80th dyn.) 


Cretans 
.»(Rfodern) 


Fig. 1. Uoduoed coofficieuts of racial likeness between the two now and other ancient 
Egyjitian and related series of male crania. 


in the region—with only slight local and secular variants—from the 1st to the 30th 
dynasty and probably in both earlier and later periods as well. Such populations are said 
to be of ‘Lower Egyptian’ type. 

In the region to the south, round Thebes and Abydos, the population was of a second 
racial type from the earliest predynastic (Badari) epoch for which there is any adequate 
craniologioal evidence. This is called the ‘Upper Egyptian’, though it would be better 
to call it Southern Egyptian. The population became modified slowly down to some 
time about the 18th dynasty. The change was such that the 'Upper Egyptian’ type 
of population came to bear a closer and closer resemblance to the ‘Lower Egyptian’, 
.thougli the two groups remained clearly distinct. About the 18th dynasty there must have 
been a fairly rapid, if not abrupt, change in the racial composition of the population of the 
Thebes and Abydos region. Nearly all the series from there, of that and later dates, are not 
of ‘Upper’ but.of ‘Lower Egyptian’ type. They diverge slightly from the populations of 
the region immediately south of the Delta, however, in the direction of the ‘Upper 
Egyptian’ type. Six of these series—viz. those from Abydos, Thebes and Denderah of 









26 Egyptian skulls 

dates ranging from the 18th dynasty to Roman times—are shown in Pig. 1. The ist dynasty 
series from royal tombs at Abydos, also shown there, is an exception on account of its date. 
The obvious explanation of its peculiar position is that it represents an intrusive and more 
or less isolated community which was derived from the other centre of population to the 
north. 

This accounts for twenty-two of the twenty-four series of crania considered. The classi¬ 
fication of these does not seem to necessitate reference to any non-Egyptian peoples. 
This is not so,.however, in the case of the remaining two series, vix. the new one of llth 
dynasty soldiers from Thebes and the 9th dynasty series from Sedment (Deltaic region). 
These two might represent the same population as far as can be seen from the direct 
compari.son, and both stand apart from the ‘ Lower Egyptian ’ constellation of serie.s (sec 
Fig. 1). The fact that the llth dynasty series from Thebes has a clo.so resemblance to one 
of CVetans, which is of modern date, suggests that the two aberrant communities in question 
may have been derived from the crossing of ancient Egyptians with people from some 
European or Asiatic source. 

The mean basio-bregmatic heights {ir), cephalic indices and height-length indices are 
liigher for the llth dynasty Thebes and Sedment series than for any other of the series 
considered. The types, defined by average measurements, of these two thus diverge from 
that prevailing in ancient Egypt in the direction of the ‘Armenoifr type, Elliot Hmith 
(1911 and elsew'here) supposed that intrusive 'Armenoid’ aliens played a considerable 
part in modifying the population of tlie country and that 'long before tlie time of the 
New Empire, Egypt was permeated from one end to the other with tliis foreign element’. 

0\ir interpretation of the evidence fails entirely to supj^ort this hypothesis. There is no 
need to suppose that any people foreign to the country played a sul)stantial part in 
modifying its population from predynastic to Roman times. The communities represented 
by the llth dynasty Thebes and Sedment series may possibly have been derived from the 
crossing of Egyptian and ‘Armenoid’ people, but they stand apart. The remarkable })oiut 
is not that two out of twenty-four populations should be peculiar in that way, but that tlie 
remaining twenty-two show interrelationships which do not suggest any admixture with 
alien stock. They can readily be explained on the supposition that there w'as a steady 
transference of population from the Deltaic region to the region of Thebes and Abydos, 
where the population was originally of a somewhat different type, from early i)redynastic 
times to the 18th dynasty. About that time the movement must have been accelerated, 
and thereafter the populations of the two centres were almost indistinguishable in racial 
type. The racial history of ancient Egypt was of a simple kind. 

7. Summary and conclusions. This paper deals with forty-four male crania of Jst dynasty 
date from Sakkara and with fifty-five crania of llth dynasty soldiers from Thebes. Indi¬ 
vidual measurements taken by Prof. D. E. Derry are given in appended tables. Judging 
from the rather small samples, the two poi^ulations represented exhibited tlie same order 
of variation, while both were rather less mixed in racial composition than the population 
of Giza from the 26th-30th dynasties. Mean measurements clearly differentiate the two 
new series from one another. Judging from characters considered singly, both series bear 
a close resemblance to some other ancient Egyptian series, and botli are of ‘ Lower ’ rather 
than ‘Upper Egyptian’ type. Comparisons are made by the metliod of the coefficient of 
racial likeness, though decidedly fewer characters than the standard set of thirty-one used 
when possible are available for the purpose. The resulting relationships are shown in Pig. 1. 
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787 About 18 years. Metopio 

79'2 Skull feipale? but pelvis definitely male 

89-9 Root of nose depressed. Slight hydrocephaly! 
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8i-8 ~ 


/y j 

84-1 About 19 years, Skull female? but pelvis definitely male 
93'5 Distorted by grave pressure 
82 'I - 

88-6 About 18 years 
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Derry’s method of taking the measurement does not aooord with biometric practice. 
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The iSakkara 1st dynasty series, which is the earliest from the region immediately south 
of the Delta, is an unexceptional member of the ‘ Lower Egyptian ’ constellation, and it can 
be supposed to typify the population of Northern Egypt at the time. The 11th dynasty 
series of soldiers from Thebes is linked to the same group, but it diverges from it. The 
type is indistinguishable from that of a 9th dynasty series, from Sedment. The former also 
ha,s a link with the type of a series of modern Cretans. The two aberrant communities of 
Thebes and Medmcnt must be supposed to have been derived from the crossing of ancient 
Egyptians with people from some European or Asiatic source. Our knowledge of the racial 
history of ancient Egypt derived from craniological evidence is reviewed. 
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Appendix 1. DsiaNiTioNs op measurements 

Individual measurements of the two series of crania are given in Appendices II-III. The 
contractions used there and in tables in the text to denote characters are: 

L ~ maximum glabella-occipital length. JS = maximum horizontal breadth. B' = mini¬ 
mum frontal breadth. H' - basio-bregmatic height. Aur. ht. - 'vertical height from line 
joining highest points of external auditory meatuses’. LB = basion to nasion. 11 = maxi¬ 
mum horizontal oiroumference above the superciliary ridges. 5'^ = arc nasion to bregma. 
iS'a = arc bregma to lambda. = arc lambda to opisthion. 8 = total sagittal arc from 
nasion to opisthion. Broca’s Q' = transverse arc from ‘the most prominent point on the 
posterior root of the left zygoma, exactly above the auditory aperture’, to the same point 
on the right passing through the bregma, fml - basion to opisthion. fmb = maximum 
breadth of foramen magnum. O'H = nasion to alveolar point. OB = facial breadth between 
lowest points on zygomatico-maxillary sutures. J - maximum breadth between zygomatic 
arches. NH, L = nasal height from nasion to point furthest removed from it on the 
margin of the left pyriform aperture. NB = maximum breadth of the pyriform aperture. 
0[ = breadth of right orbit from the daoryon. 0^ = maximum height of right orbit. 
Prosthion GL = basion to prosthion. 
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THE GENERALIZATION OF ‘.STUDENT’S’ PROBLEM 
WHEN SEVERAL DIFFERENT POPULATION 
VARIANCES ARE INVOLVED 


By B. L. WELCH, B.A., Pii.D. 


1. Inlroduclion and summary. Let ?; be a population parameter which is catimatcd by an 

k 

observed quantity y, normally distributed with variance Let = v A,(r|, where the 

are known positive numbers and the (t\ are unknown variances* Huppose that the observed 
data provide estimates of these variances, based on/^ degrees of freedom, respectively, 
80 that the sampling distribution of a\ is 








exp 





(I) 


and that these estimates are distributed independently of each other and of y* 

A very simple particular case of this set-up occurs when we have samples of and 
respectively, from two normal populations with true means Ui and ag and standard devia¬ 
tions 0*1 and (Tg. If?/ is the true difference (a^ — u^) between the nieans, the estimated difference 
is y = (ari-iTg). The variance of the estimate is crl - + Ag^rl), where Ijn^ and 

Ag = l/?i 2 . The estimated values of erf and cr| are al — iy/j and si ^ where and Xg 
are the respective sums of squares of observations from the individual sample means and 
= (? 2 ,j^-l) and/g — (ng—1)* These are distributed in the form (1) and the postulated 
conditions of independence hold, 

Another particular case, again with fc = 2, arises when we wisli to compare two regression 
coefficients, fitted to independent sets of data, without making the assumption that the 
population residual variance about the true regression line is the'same for both sets. 

The present paper is written mainly witli these practical a])plication.s of tlie case — 2 
in mind, but the results are expressed generally for any k\ since no further analytical diffi¬ 
culties are involved. It will be shown how probability Ktateinents about //, considered as 
an estimate of y, may be made similar in character to tho.se which \\\ vS. (loHset derived for 
the mean of a single sample of n observations (‘Student’, lOOH). We shalh in effect, seek a 
quantity /t, calculable from the observations, with the property that the (thanco of the 
difference (y-rj) falling short of h is a given probability P, It in clear that h must be a 
function of the individual variances sf and of P. If the abbreviation l^r. is used to mean 
'the probability of the relation in the bracket following’, our problem is to satisfy the 
equation 

■?")] == T. (2) 


In Gosset’s case the solution was, of course, simply 


Br. [(a: —a) < li.sl^ln] = P, (3) 

where tp is the value, corresponding to the probability level P. in the ‘■.Stmlont' bdistribution 
with / = (n- 1) degrees of freedom. 
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In the next section the mathematical derivation of the exact solution of (2) is given. This 
is tlien followed by some consideration of its expression in numerical terms. Firsts a series 
solution in powers of 1 //^- is developed, which, may be used for calculating tables. Then some 
comparisons are made with a non-series approximate solution which is based on a particular 
way of regarding the distribution of a quantity of the general form 2 : = 

Some brief discussion is then added which may serve to place the present contribution 
in its proper relationship to other papers which have been written on this topic. 

Finally, it is shown how the inequality (2) may be adapted to provide an interval estimate 
for 7j. 


2. Mathematical derivation of solution. Let 5|,..5/^, P) denote the probability that 
[y — Tj) is less than h{s\, 5|,..., P), given 4 = 1,2,..., Then, since y is distributed quite 
independently of the estimated variances, we have 


j{sl,sl ■■■>4, P) 


V(S\,cr'^d “ 

J n “ —CO 


1 


du 


\ Malsl...,slP) \ 
V(^A,o-|) )’ 


(4) 


where I is \ised to denote the normal probability integral. The condition of equation (2) is 
then simply that, if ...,s\,P) is averaged over the probability distributions of sf as 

given by (1), tlie result will equal P. Thus 


l.i 




.A>P)Y{p{4)dsl = P. 


(5) 


Now we may expand j{s\, s|,..., s\, P) about an origin (cr|, cr|,..,, a%) in a Taylor expansion. 
Thus 

j(5f,...,4, P) = exp [^(4-cr,f) w^,tv^, P), (6) 


it being understood that the exponential is to be expanded in a power series in and that 
9,. is to be interpreted so that 

^J{u\,w^,...,w,„P)] . (7) 




%.Wfr.P) = 

On making the .sub.stitution of (6) into (5) our result may be written 

P) = P, 


where 


0 = n 


exp[(4-a-^)9,.]p(4)«^Si- 


Now, substituting into (U) from (1), the integral comes out in simple form, i.e, 

2cr?9d-i7-' 


Q = n 1 
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— exp 

- exp 

1 +I 
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( 8 ) 

(») 


( 10 ) 



30 A generalization of ‘Student's' problem 

Substituting (4) into (8) we have finally 

This, in a very condensed form, is the solution to our problem.* The operator © constitutes 
a direction to carry out the partial differentiations indicated by (10). Wj must then be equated 
to cr^. Tlie solution of the resulting equation will give /i(o-f,(r|, ....(ri P) and therefore the 
required /i.(sf,Sa, P). 

3. The development of the series solution. It will he convenient to write h{w) for 
h(wi,w.j,, P) and ^ for the normal deviate such that I{^) = P. We may then expand 

Taylor series about ^ as origin. Thus 
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}h{w) 


exp 


f H'>^) f 


D 


Up), ^ 


( 12 ) 


it being understood that the exponential is to be expanded in powers of P, and that these 
powers are to be interpreted so that 


Dn{v) = 


Equation (11) then becomes 

©exp 


dv*' 


m 


ji'“f 

m - m- 


(13) 

(14) 


W(2'A,:0-?) 

This may now be solved by successive approximations. 

The initial approximation is the largo-sample normal approximation 

Kiro) ^ (15) 

and we may write 

h{w) - g+ hi{w) 4- k^(w) + etc., (16) 

where hi{w} includes terms of order 1 //^, h^iw) terms of order 1 Iff and so oik For the moment 
we shall treat terms of the order I Iff as negligible. Then (14) gives 


0exp 


g] 2)1 ext) r + 7) 

XfiSh^crl) J PLW{A:Ai<Tf) j\ 


i.e. 0 exp 




Or, using (10), 

'fei(tr2)P 
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The equation of the first order term to zero gives 

£(1 + ^*^) 


i;j 


l{v) = 0. 


/ii(cr2) = 


V A^gj 

Ic\ 

[AAiO-f)^ 


(19) 


( 20 ) 


* Equation (11) can also be expressed as an integral equation and this form may he necessary for 
providing numerical values where tho/< are very small. 
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This can then be substituted in the second-order term which, when equated to zero, will 
give The process may obviously be extended to higher orders, although the expressions 

become so complex that a slightly different procedure has then been found to be preferable. 
To terms of order 1//| our solution is 




1 , (i+g^)\ f7l 

n I 

4 (SX.sIf 

2 


+ 




(.f) 

(15 + 32g2 + 9g*) 

m 


32 

J 


( 21 ) 


It may be noted that in the particular case & = 1, this reduces, as it should, to the already 
known expansion of the deviate of the straightforward ‘Student' distribution (Fisher, 
1941, p. 151), viz. 


1 + 


( l + g 2) ^ (3 + l6gH 5g^ 


-h etc.l. 


( 22 ) 


4/ ‘ 96/2 ^ 

It is proposed in another communication to give tables of ^( 5 ^) based on the expansion 
(21) carried to some further terms. 

4. Discussion of a non-series approximation. It will.be recalled that in Gosset's original 
approach to the single sample problem (‘Student', 1908) his initial step was to note that the 
first four moments of the distribution of s^ were consistent with-the assumption that the 
distribution could be represented by a Pearson Type III curve. He was fortunate in this 
way to rediscover a distribution which had already been found by Helmert, as this permitted, 
him to go on to the derivation of the ^-distribution. In our present case, as in many others 
arising naturally in statistical work, we are led to consider, instead of a linear function 
of several , If this linear function were distributed in a Pearson Type III distribution 
a whole range of new problems could be dealt with by well-established theory. However, in 
general, we do not have this good fortune. For is of the form Da^Xi^ where = K^llfu 
and the distribution of this quantity is only of Type III if all the except one, are zero, or 
if all the happen to be equal. 

Nevertheless, for practical purposes an approximation to the distribution of using 
a Type III curve with start, mean and variance suitably adjusted, can still be useful. In two 
previous papers (Welch, 1936,1038) I have employed this method to obtain numerical com¬ 
parisons of the merits of different statistical procedures, where full calculations with the 
true distributions would have been unduly laborious. The method of determining the con¬ 
stants in the approximation was given for the case jfc = 2 in the first of these papers and is 
as follows. 

If z — approximate distribution curve is written in the form 

1 . . . . / 2 \ 


p{z) dz = 




(l)‘ 


d\ 


rm vgj vg) 

then making the first two moments of (23) agree with the true moments of z, we find 


/ = 




a% + b%‘ ^ «A + &/a ■ 

Phrasing the matter rather differently, we can say that zjg^ is approximately distributed as 




(23) 


(24) 
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with degrees of freedom/« Of course/, given by (24), will in general be fractional, but the 
letter used to designate this quantity was chosen, and the term ‘ effective degrees of freedom ’ 
has been used, because by doing so we can appeal immediately to a considerable body of 
further theoretical results. 

In particular we can say that the criterion 

iy-v) 


V == 


follows approximately the ^Student’ ^distribution with degrees of freedom 




Afol A|(i 

A 72^ 


(26) 


(26) 


More generally, when k is not restricted to 2, the same line of argument leads us to say 
that the criterion {V'-rf) 

is approximately distributed as ‘>Student‘s* t with degrees of freedom 

(2’A;rrf)2 


f- 


v'Vfoi 

^■ 7 ; 


( 28 ) 


Not knowing tl\e rr^’s in (28), there are several ways in which we may now proceed, depending 
on what weight we may bo willing to attach to any vague a priori notions we may possess 
of their relative magnitudes (of. Welch, 1938). If we are not willing to assume anything, 
perhaps the best choice is 


/ = 


7 A2<!*\ 


{ 


rM.\ 
fi + 2j 


(29) 


It may be shown that the numerator of (29) has, in repeated samples, an average value 
(i7A,-(rf)2, and the denominator has average value Z’AfIn a certain sense, therefore, 
(29) is a fair estimate of (28). 

To sum up, then, the interpretation of y as an estimate of ?/, using the present type of 
approximation involves only the reference of the criterion (27) to tables of the SStudent’ 
distribution, entered with degrees of freedom given by (29). 

Some further light is nbw thrown on this procedure by the expansion for the exact solution 
of our problem derived in the preceding section. For the implications of referrmg v to the 
SStudent’ distribution may be seen by substituting / from (29) into the expansion (22) of 
the ‘Student’ deviate. On doing this and then expanding in powers of l/f it is found that, 
in effect, our approximation corresponds to assuming that 


h(s^) = 


1 + 


(1 + ^^) 


(-f) 


(1+1=“) 


[ fj) 




(61 + 64g^ + 5g*)r f, ) 


2 

-h... 


^./., 1 (30), 

whereas, in fact, the true solution is given by (21), Comparison shows that we have exact 



B. L. Welch 


33 


agreement to terms of order 1//^ and in the first of the quadratic terms. To the second order 
the difference between the expressions in square brackets in equations (21) and (30) is 



(31) 


This difference vanishes if any one of the is overwhelmingly larger than all the others, or 
if s\ is proportional to/JA^. It appears that, in general, the difference is not likely to be large. 
We have, therefore, found some justification for using the Type III approximation in the 
present case. 

The above comparison has been made on the basis of the series developments, but it should 
be borne in mind that approximations based, on positive frequency functions, such as those 
falling under the Pearson system, usually provide a higher degree of accuracy than might 
appear from any consideration of expansions. Furthermore, they are apt to give an insight 
into the nature of the situation which may sometimes be lost in working out the details of 
exact solutions. In the present case I feel that the comparison of this section serves to give 
added confidence in the exact solution,* which I have put forward in the previous two 
sections, quite as much as it demonstrates the value of the approximate method. 


6. Further discussion. In comparing the present contribution with other work on the 
subject, the essential point to notice is the averaging process involved in equation (5). We 
are not trying here to make probability statements valid ion fixed s\, but are averaging over 
the joint probability distribution of the sf, taking into account, therefore, the different 
values which can arise by chance in sampling from populations with fixed erf. 

This averaging over the joint distribution of the sf is parallel to the step taken in 
Section III of Gosset’s original memoir (1908) where, in effect, he starts with the distribution 
of t for samples wit)! fixed s and then averages over the distribution of s which he has already 
derived earlier. He thus arrives at the unrestricted distribution of t (or, more strictly, of a 
quantity z, which is equal to t multiplied by a constant). This distribution forms the basis 
of the significance tests which he illustrates in his Section IX and of the method of deriving 
interval estimates for the population mean which he outlines in his Section VIII. 

In the present paper the parallelism with Gosset’s work may be obscured to some extent 
by the fact that we do not from the outset seek the probability distribution of some pivotal 
quantity like t, explicitly expressed. It so happens that we are able to proceed to a method 
of deriving an expansion for the required probability level without making explicit reference 
to such a quantity. Nevertheless there remains the important resemblance with Gosset’s 
development, in that we do not confine ourselves to samples with fixed sf. 

This procedure stands in sharp contrast to the formulation of the problem of comparing 
two means, favoured by R. A. Fisher (e.g. 1941) and H. Jeffreys (1940). These writers 
prefer a solution which they ascribe initially to W. U. Behrens (1929). Looked at from one 
point of view, Behrens’s paper appears to contain some gross algebraical errors. Fisher and 
Jeffreys, however, develop lines of argument by means of which they claim that Behrens’s 
solution is quite justified. It seems to me difficult to say how far (if at all) any of these 
arguments may have been in Behrens’s mind when he wrote his paper and I shall not 
attempt to elucidate this question here. We may, however, permit ourselves one observation 
about the developments according to Fisher and Jeffreys. 

* Exact in the sense that it is independent of the inelevant population parameters (Ti“. 
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Both these writers, at some stage, limit the field of their probability inferences to a sub¬ 
set in which the sf are regarded as fixed. In order to solve the problem on these lines J effreys 
introduces an a priori distribution function for the unknown cr„ following his general 
philosophy for dealing with such questions. Fisher, on the other hand, arrives at the same 
answer by a special utilization of what he terms the fiducial distribution of cr^. 

Jeffreys’s approach here does not raise any new issues to those who are familiar with the 
general body of his researches on statistical inference. Fisher’s justification of Behrens’s 
solution is perhaps of more immediate interest as it raises controversial points which are 
important more specifically in relation to our present topic of discussion. For although 
Fisher’s approach has been very much criticized by a number of writers, starting with 
M. S. Bartlett (1936), the critics have not wished to throw doubt on the whole body of 
results which Fisher includes under the heading of fiducial inference. The criticism has been 
for the most part selective, directed mainly at the way in which so-called simultaneous 
fiducial distributions of several parameters have been defined and manipulated. 

I have, myself, quite definite views on these questions (particularly on the usage of the 
word ‘fiducial’) but do not feel that I need express them at any great length here. I dis¬ 
agree with Fisher, but this divergence of opinion must already have become apparent in 
the way I have defined the field within which I make ray probability inferences about rj. 
It appears to me to be quite artificial to restrict our view to one which, even in a limited 
sense, fixes sf. It is true that, in the two-sample problem, we have to draw our inferences 
from the unique pair of samples observed, or, more precisely, from the statistics x^, x^, 
and si which they provide. These statistics are our only data for the purpose of making 
inferences, but we add something to these data in the interpretation when we regard the 
samples as being drawn randomly from hypothetical normal populations. Once having 
embarked on this method of interpretation, we should stick to it consistently throughout. 
The sampling variations of should be taken into account only by a direct use of the 
probability distributions as given by our equation (1) and not by any inversion such as is 
involved in Fisher’s conception of the fiducial distribution of erf. As we have seen, it is 
quite possible to make probability statements about the difference between the population 
means without making any reference whatever either to inverse probability or to fiducial 
distributions. 

The distinction fietween the procedure which Fisher advocates and one which averages 
over the distributions has, of course, been stressed by most of the writers who have 
contributed papers on the subject, from whatever viewpoint (e.g. Bartlett, 1936, p. 560, 
and Yates, 1939.) What has been lacking hitherto, however, is a solution, analogous to 
Gosset’s single sample solution, which makes complete use of the information contained 
in the data provided. Bartlett indicated one particular way in which probability inferences 
about the difference between two population ineans might be made, but was careful to point 
out that the problem of making the best possible inferences (in the theoretical sense of 
utilizing all the information in the data to its full extent) was still an open one. There has 
indeed been some doubt expressed whether a fully satisfactory solution from this point of 
view existed at all. I believe, however, that the one I advance above in equation (11), 
and’develop in equation (21), meets all the requiremerits that one can reasonably expect. 

Whatever conclusion the reader may come to on these matters, however, he will probably 
wish to know how, in the numerical details, this solution will differ from that of Behrens. 
This will be more easily seen when some tables become available, but fortunately certain 
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comparisons can already be made. For Fisher (1941, p. 156) has provided a series ex¬ 
pansion of the Behrens solution. In out’ notation, and with — 2, this may be written, to 
order 1//^, as follows: 


h{3^) = 


4 {A^8l + A^8l)^'^\A'^fJ {Ksl^X,8l)^ 


(32) 


Even to this order, this differs from our equation (21) in the inclusion of an extra term. In 
other words, although the two solutions are the same when samples are large enough to 
adopt the large-sample normal approximation, they differ immediately we take into account 
the first corrective term, i.e. they differ as soon as we begin to attach any importance to 
‘ Studentization \ 


6. An interval estimate for tj. We have shown in §§2 and 3 how to calculate a value 
h(sl, si, ,.P), depending on the observed variances s|,..such that the probability 
is P that {y — 7j)< h{sl, s|,,.., , P). This provides a method of testing the consistency of an 

observed y with a prescribed value rj. 

When the question is not whether any particular given tj is contradicted by the data, but 
rather one of estimating 7} and at the same time of providing a measure of the uncertainty 
of the estimate, the further step required is immediate. For, as in the case of a single sample, 
the order of the words in our probability statement can be changed so that it becomes—the 
probability is P that tj is greater than {y—’h{sl, s|, P)}. An interval estimate for tj is then 

obtained by taking two levels Pi and P^ for P. Thus the probability is (Pi —Pg) that tj lies 
between {y-h(sl,si, ...,si, Pi)} and {y-h(sl s|,..sf,P^)}. 

If Pa = ( 1 — Pi) the range will be symmetrically placed about y. Thus, .for example, if 

= 0*95 and Pg = 0-05, the chance will be 90 % that tj hes within the range 


2/± 1-6449 V(PA^5f) 


1 + 


1 +(1-6449)2 


Kf) 


+ eto. 


(33) 


4 (PA,«f)2 

It may be noted, incidentally, that this range is always narrower than similar ranges 
calculated from Behrens’s solution. 
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THE DISTRIBUTION OF KENDALL’S t COEFFICIENT OF RANK 
CORRELATION IN RANKINGS CONTAINING TIES 

By 6. P. SILLITTO, Ph.D., B.So., Research Department, I.CJ. (Explosives Ltd.) 

A new coefficient of rank correlation has been described by Kendall (1938, 1942, 1943) and 
denoted by him as r. This coefficient has advantages over Spearman’s p in respect of the 
smoothness of its distribution and the rapidity with which it approaches normality, thus 
facilitating significance testing, and in being readily adapted to cases of partial rank 
correlation. 

The distribution of t has been worked out by Kendall (1938, 1943) for cases in which 
neither ranking contains members which are graded equal, i.e. rankings containing no 
‘ties’. It is the purpose of the present paper to deal with cases, which frequently arise in 
practice, in which ties occur in one of the two rankings. The method is a generalization of 
that of Kendall and will be given in some detail for the case of tied pairs, while the results 
of further generalization to multiplet ties will be indicated without detailed proof, which can 
in all cases be effected simply on the lines indicated. 

Definition of t foe bankings containing ties 
In counting the ‘score’ of a pair of rankings, by the methods suggested by Kendall, each 
member is compared with the other members of the same ranking, and additions to or sub¬ 
tractions from the score are made depending on whether it is smaller or greater in each case. 
If some members are ranked equal then it is proposed that no change be made in the score 
in comparing them. This obviously accords with the intuitive aspects of ranking. Thus in 
the pair of rankings following, the score is + 8; 

1 2 3 4 6 6 

2 1 3 6 6 3 

The maximum score possible is thus obviously reduced by the presence of ties, and it is 
evident that the presence of each tied pair reduces the maximum possible score by unity, 
so that it becomes ^w(to- 1) —for the case of a ranking of n members containing p^ pairs. 
Thus for such a ranking t would be defined as 

r = 2SI{n(n-l)-2p^}, 

where S is the observed score. 

Generally, each r-tuplet tie reduces the maximum possible score by ^^(r-1)* so that for 
a ranking of n members containing p^ pairs, p^ triplets, ..., r-tuplets, 

_2^_ 

?i(w-l)-2p2-6p3-.,.-r(?--l)p/ 

The sum of the frequencies of the possible scores 
When no ties are present, each permutation of the n members produces a possible score so 
that there are in all n ! possible scores. When ties are present they decrease the number of 
possible permutations of an assigned set of members, but, on the other hand, they give rise 

* This result has been given by Kendall (1946). 
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to further families of scores due to the different places in the ranking which can be occupied 
by the tied members. Thus, for instance, the rankings 

113456, 122466, 123356, 123446, 123466 

all give rise to the maximum score, 14. 

Considering any assigned ranking, the number of possible permutations with ^^2 P^hs 
present is n or if there are in additionpg triplets,,.., r~tuplets, n !/(2 !)^2 (3 !)^3.., (r !)^^. 

The distribution of scores of an assigned set of ranks will be referred to as the basic dis¬ 
tribution for the type of ranking concerned, since consideration of the possible ways of 
assigning the pairs, triplets, etc,, among the members of the ranking has only the effect 
of multiplying the frequency of each score by a constant factor. This factor is the number of 
ways of distributing the ...ranks among the n members. This is the 

number of possible permutations 

Px^^P^\p^\^.^Pr\ 

■Basic freqxjeitoy bistributions of the scores 

The basic frequency distributions can be established by an extension of the methods given 
by Kendall. Considering first the case of tied pairs the frequency function of the basic 
distribution of the scores may be written/(AS,?^,_ 2 ? 2)3 where p^ is the number of pairs. The 
frequency generating function is then Now consider the addition of another 

tied pair, with a greater ranking than any of the existing ranks. If it is added to the extreme 
left of the ranking it adds — 2^^ to the score. Moving one of the pair one place to the right 
adds 2 to this new score; bringing the other added member up to it adds another 2. Starting 
again with both the new members on the extreme left, movement of one of them two places 
to the right adds 4 to the new score, bringing the other up to it in two steps each of one place, 
adds successively a further 2 and 4. Proceeding in this way all possible additions to the old 
score which may be brought about by the addition of a tied pair of new members are repre¬ 
sented by the array 

— 2n --(2n—2) —(272.— 4) — ( 27 ^—6), ... 0 

-(271-4) -(272.-.6) -(272.-8) +2 

-(27^^ 8) -(271-10) +4 

-(27i-12) 

•f 272. 

Thus the addition of a new tied pair has the effect of multiplying the frequency generating 
function by 

The addition of a single new member to the ranking has the effect of multiplying the 
frequency generating function by 

as shown by Kendall, the presence of tied pairs in the existing ranking having no effect. 
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With these two recurrence relations there is no difficulty in drawing up a table of basic 
frequency distributions for tied pairs as exemplified in Table 1, in which only positive values 
of the score are shown, negative values being obtainable by symmetry. 


Table 1. Distribution of the score 8 for values of n from 3 (o 7, and for rankings containing 
pairs of members ranked equal [only positive half of symmetrical distribution) 



Before the construction of the table has proceeded far, however, it becomes evident that 
there is a recurrence relation between individual frequencies for any given value of ti, such 
that the frequency of any score 8^ for pairs is the sum of the frequencies oi I and 
for 1 pairs. This obviously arises from the fact that if two members ranked equal, 
say rth, in a ranking with 1 pairs are subsequently distinguished and given rankings r 
and r 4* 1, this will increase the score by unity if the (r +1) member falls after the (r) member 
when the ranking is arrayed against another ranking in the natural order 1,2, 3, and 
reduce it by unity if the other member of the pair becomes the (r +1 )th; and these two possi¬ 
bilities complete the ways of forming a ranking with pairs from one with pairs. 

This simple relationship, which may be wiitten 

/(Sj, n, p.i] ==f(8.j+l,n,i)i+l) +f{8j -1, w, pa +1), 

or taking another way of writing the basic distribution function 

i>{^jyPxiP%) - li Pi- P2-¥ 1) 1, Px - 2^ p^-^ 1), (1) 

Pi being the number of members not in tied pairs, is of great assistance in tabulating the 
frequency distribution, and will be used below to establish the formula for the variance of S. 
It can be generalized to cover the effect of increasing the number of r-tuplets, when it becomes 

<l>{Sp Pi, Pi,-., Pr- 1 , Pr) = ^(^ 3 ' - r - 1, i?! - 1, JJa, ..., _ 1, + 1) 

+ -r- 3,1,2)2,,.., 35,^1 -1, ^5, +1) 

+ . 

■^j>{8j^-r- 1,23i- 1, Pi ,.... 1.25,+1). 


( 2 ) 
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Fbequen-cy and probability distributions op the score S 
From a table of basic frequency distributions such as Table 1 the construction of a table 
showing the probability of attaining or exceeding an observed value of S by chance from an 
uncorrelated pair of rankings can obviously be constructed, and Table 2 shows such pro¬ 
babilities (positive S only, negative values obtainable by symmetry) for values of n up 
to 10, and all possible numbers of paired of triplet ties. 


The variance op S 

The variance of S when ties are present can be readily derived by using the recurrence 
relations given above and the value given by Kendall for the case of no ties. For the case 
of tied pairs consider 

+ 2{{S + l)<l){S-\- ly Pi — 2, p2+l) — {S— l)<p{S~ 1, Pi— 2, P 2 +1)} 

p2H-1)4.^(;S-1,Pi'-2, P 2 +I)}. 

If now both sides of this equation are summed over all values of Sy the terms on the left- 
hand side become ^ t 

^j^var^A(/S, pi- 2, P2 4-1). 

The first terms on the right-hand side become by virtue of the recurrence relation (1) 

the second vanishes through the symmetry of the distribution, while the third becomes 

2. n! 

Hence there is obtained 

ya,x^{Sy Pi - 2, P 2 + 1) == var^(;S\ Pi, Pa)-1, 
and so var 9 i(^, n’-2p.^yp^) = var^(iS^7i)—P 2 


nin— 1) {2n-\-6) 

=-Ts- 

using Kendalfs result. 

These results can also be generalized, using equation (2), to deal with multiplet ties, 
obtaining 

var 96 (<S', iVi-1.25^+1) = .■.,Pr-v 


and va.T^(S,pi,p^,...,Pr) 


n{n—l){2n + 5) 3 + 8^ 3 + 8+... + (r®-l) 

18 3 3 


It is obvious from these equations for multiplet ties that for any given number of ties of 
each multiplicity the variance will tend towards that of the system without any ties as n 
increases. 


Application to a practical case 

The following results were obtained in a practical case in which two different tests were 
carried out on one each of a set of products. The problem is to determine the degree of 
relationship between the results of the two tests. It is also an instance of an oocurfence 
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which arises at times in ja'actiee, in which some nf the results are ‘off the scale’ of measure- 

vena tied ranking of 18 . 


T.'-nfc A 

4()'H(I 

41-7(l 

3(1'75 

:)7s55 

29-40 

25-20 

20-75 

28-45 

20-,85 

20-35 

Test B 

1‘5 

1-r. 

I'5 

2-5 

3-5 

10 

2-5 

2-5 

2-5 

(i-f) 

Teat A 

21'4() 

ItMiO 

IH‘95 

22’90 

22‘Hll 

20-2.5 

24-45 

22-7(1 

20-,50 


Test B 

>1(1 

>10 

>10 

>10 

>10 

>10 

> 10 

>10 

> 10 


Test A 

22'()0 

27-,50 

2:t7ri 

30-S0 

2M10 

27-10 

22-10 

19*25 

25-45 

24-10 

'Pest B 

>10 

:t5 

1'5 

2-5 

7 

0-5 

7-5 

9 

3-5 

>10 


Ranking the results according tis their order in test 4 (from highest values to lowest) there 
is obtained 
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4 

,5 
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13 
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5 
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10 
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21 
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20 
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28 
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18 

IS 

IS 

Hi 

IS 

IS 

15 

IS 

IS 

17 
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The lower ranking has 1 pair, 1 triplet, 1 (luadruiihtt, 1 (luintuplet and om* I’i-memlmr 


multiplet. The niaxiinum possible score is 



1 -1) - (i • 


in (ICj:. 32(1 in such 


a 


212 

ranking. The actual semv is +212, giving r “ •-.(l•fl(i2;l. The variunco of the distribution 

of the scores obtained with such rankings in the case of no correlation is 
2i).2H.(i3 , 11 21) no (i3H , , 

18.. '“it '.r ;r' :i 

Hence the probability of obtaining a score of 212 or more from an micorndated jmir of 
such rankings cori'esponds to the proliability of a normal variate attaining or ex 
212 

^('ff)!)(b33) ” standard deviation. 
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THE USE OF RANGE IN PLACE OF STANDARD 
DEVIATION IN THE i-TEST 

By B. LORD, B.So., Shirley Institute, Didshury, Manchester 
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(i) Introduction 

The difference between liigliest and lowest values has always been recognized as a general 
indication of the variability of quantitative data, It was not, however, until 1925 that 
attention began to be focused upon the range as a useful statistical tool. In his paper ‘On 
the extreme individuals and the range in samples taken from a normal population’, Tippett 
(1925) obtained an expression for the mean value of the range in repeated random samples, 
and computed its value in terms of the population standard deviation for samples of size 
n — 2 to n - 1000, He also gave numerical approximations to the values of the moments 
of the range for fairly large samples. 

The work was taken up by E. S. Pearson (1926), who determined numerically the exact 
values of the moments of the random sampling distribution of the range for small samples 
of size n < 6, and also approximations to their values for samples of medium size. In a 
subsequent paper E. S. Pearson (1932) tabulated the upper and lower percentage limits for 
the distribution of the range from frequency curves fitted with the values of the moment 
coefficients taken from both of the earlier papers cited above. 

The next advance was the determination of a general expression for the distribution of 
the range in samples of n random values from any population by McKay & Pearson (1933). 
For the normal population, only in the case of ri = 3 was it found possible to obtain a fairly 
simple analytical form. (The distribution for n — 2 of course, well known, taking the 
form of the positive half of a normal curve.) 

Hartley (1942) later determined an expression for the probability integral of the range and, 
with Pearson (1942), tabulated this for the normal population for samples between n = 2 and 
n = 20, This latter paper also contains a table of several percentage limits of the range 
in samples from a normal population. These limits are derived from the numerical values of 
the probability integrals and replace the approximate values previously given by Pearson 
referred to above. 

Tippett (1925) and Pearson (1932) have pointed out that although the total range in a 
sample may be used for the purpose of estimating the population value of the standard 
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deviation, a more efficient measure may be obtained by dividing the sample into random 
subgroups of equal size and using the mean range of the subgroups in place of the total range. 
The efficiency of range estimates of standard deviation is, of course, always less than that of 
roofc-mean-square estimates, but the work of Davies & Pearson (1934) and Pearson & Haines 
(1936) indicates that information is not discarded to any serious extent providing that the 
number of observations in the subsamples is not greatly in excess of about 10. 

As a result of the work outlined above, the range is now of considerable importance in 
many fields, especially in industrial quality control, where its simplicity has enabled it to 
be extensively and easily applied to the measurement of fluctuations in the variability of 
quality of a manufactured article or material. 

In the present paper an investigation is made of the use of range estimates of standard 
deviation in the consideration of the statistical significance of deviations of sample means in 
normal random sampling theory. This use of range estimates of standard deviation is 
analogous to the use of root-mean-square estimates in the well-known <-test. Tables are 
given, at several probability levels, and these may be employed in determining the statistical 
significance of either the deviation of a sample mean from some fixed or hypothetical popula¬ 
tion value, or the difference between the means of two samples. These tables may also be 
used for obtaining rapid estimates of the accuracy of a sample mean from the variation 
within the sample as measured by the range. The use of range, in place of root-mean-square 
estimates of standard deviation, in this modified form of the i-test necessarily entails some 
loss of precision. It will, however, bo shown in a future paper that this reduction in accuracy 
is negligible for all practical purposes. Furthermore, this slight disadvantage of the new test 
is compensated by its greater simplicity, involving a reduced amount of computing compared 
with the usual 1-test. 

The range test is suitable for application to many problems frequently encountered in the 
treatment of various types of experimental data and in considering the mean character value 
in small samples in biological experiments. In the industrial field, the range test may be 
used for detecting changes in mean quality level, especially where the variation is not 
under strict statistical control or is subject to secular changes, or for determining whether 
the average level of a batch determined from a sample is in accordance with specification 
demands. A number of these problems are covered in the examj)les given at the end of the 
paper. 


(li) The 1-tesx* 

In testing the significance of the deviation of a sample mean 5i from an assumed population 
value use is made of the ratio | ~ ^ | 

where N is the size of the sample and a is the root-mean-square estimate of the population 
.standard de^'iation determined from the sample. In applying this ratio it is assumed that 
the N values form a random sample from a normal population of which the mean is 
standard deviation (r and the distribution of values of x is given by 

1 


* ‘Student’ (1908), E. A. Fisher (1925). 
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More generally t may be defined as the ratio 

i = xjs, (3) 

where x and s are statistically independent, x being a quantity distributed normally about 
a mean of zero and s a root-mean-square estimate based on v degrees of freedom of the 
standard error of x. Although the use of the tables of the probabihty integral of t enables 
the most efficient tests to be made of the various forms of the so-called ‘Student’s Hypo¬ 
thesisoccasions frequently arise when more rapid tests are desirable, especially if accom¬ 
panied by only inappreciable loss of accuracy. The calculation of 5 , depending upon the 
squaring of numerical quantities, entails a certain amount of labour, especially if tables of 
squares or a calculating machine are not available. The use of the range, or the mean range 
determined from random subgroups in a sample, enables very rapid estimates to be made 
of the population value of the standard deviation cr. In the following section these range 
estimates are used in place of root-mean-square estimates in a modified form of the ^-test. 


(iii) The modified test (^a-test) based on bange 
Here we replace the s of ‘ Student’s ’ ratio by an estimate of a* based on the range. Thus 


where “ 0 ? is a quantity distributed normally about a mean of zero and w{myn) is the mean 
value of m ranges w, obtained from m independent samples or subgroups, each containing 
n observations. The constant in a commonly used notation,* is the expected value of the 
range in samples of n, randomly selected from a normal population of unit standard devia¬ 
tion. The ratio W(m, n)ld^ is therefore an estimate of the standard error of x obtained from 
range and, as such, replaces the root-mean-square estimate s used in the ratio t ^ xjs. 

Except for a few special cases, it has not been found possible to determine the analytical 
form of the distribution of u, but several tables of percentage points have been computed 
for use in testing the various statistical hypotheses normally covered by the it-test. The 
computation of these tables is considerably simplified by first determining the percentage 
points of the distribution of the subsidiary quantity 


q = q{mj n) 


u{m^ n) X 
d^^ w{m, n) ’ 


( 5 ) 


and the multiplying by the corresponding value of d^ to obtain the percentage points of the 
u distribution. 

To simplify the algebraic expressions in what follows, u, w and q will be written for n), 
w{mi n) and q{m, n) where no confusion is involved. 

The distribution of both u and q are clearly independent of cr. Hence, without any loss 
of generality, cr may be taken equal to unity in considering the distributions. The distribution 
of X will therefore be defined by 


* Seo, for example, Pearson (1935), pp. 84 and 90. 
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Furthermore, let the distribution of the range w in a sample of n he y = p{w), that of w 
hey - p{w), and that of g be 1 / = p{q). Then since x and w are defined to be statistically 
independent, we have the distribution of g given by 


p{q) == \pl^)p[x)dwdx, 


( 7 ) 


where the integral is to be evaluated over the field,of all values of x and w subject to the 
relation (6) and to the conditions: 

— oo<x<co, 04 v}<co. ( 8 ) 

Since x is distributed symmetrically about zero, and w is independent of x, the ratio q is 
also symmetrically distributed about zero. Let q„ be the value of q such that a is the chance 
that 1 ^ 1 ^ The quantity a represents the total area of the two equal tails of the dis¬ 
tribution lying outside deviations ± q^, and we have 

ria 

(l-a) = 2j^ p{,q)dq. (9) 

Alternatively, from (6), (7) and (8), this may be written in the form 

poo ( _ \ 

= Jo p(x)dx\dw 


= 2 i^p(w) 

Except for a few cases in which tlie analytical form of the distribution of u has been 
obtained, equation (10) has been used to compute values of q^ and, hence, the percentage 
points of u for values of a = 0-10, 0-05, 0-02, 0-01, 0-002 and 0-001, with values of n from 
2 to 20 and values of m from 1 to 10, 16, 20, 30, 60 and 120. 

The percentage points of u = u(m, n) are first considered for the case when the estimate of 
standard deviation is based on the value of a single range of-ft random values (i.e. for m = 1). 
This treatment is followed by the case of w = 2, and finally consideration is given to the 
general case using estimates of standard deviation determined from the mean of m ranges 
each from an independent subgroup of n random values. 


rma 


f{2n) 


e~^^‘dx)dw. 


( 10 ) 


(iv) Computation of percentage points of the distribution of 
u - m(1, n), i.e. case with m~l 

Throughout this section the estimates of standard deviation are all based upon the value of 
the range in a single set of n random values of the variate (thus w — w). In the case of w = 2 
and «, = 3 analytical solutions are derived for the distributions from which the percentage 
points of q and u are calculated. For » 4, percentage points in the neighbourhood of those 
desired are determined by quadrature methods, arid the required points obtained from these 
by interpolation. 

Special case n = 2, m = I 

The distribution of ranges (lo) in samples of two random values from a normal population 
with unit standard deviation is the distribution of absolute differences between random pairs 
of variate values, and this may be easily shown to be 

p{w) dw =dw (0<w<oo). 


(11) 
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Since x and w are independent, it follows from (6) and (11) that their joint probability 
distribution i^ 


1 


p{w,x)dwdx — — 


( 12 ) 


Transforming to new variables, q — xjw and w and noting that the Jacobian of the trans¬ 
formation 

d{XfW) 

d(q, w) 

the joint distribution of q and w is given by 




p(q,w)dqdw = 


(13) 


(14) 


To obtain the distribution of q it is necessary to integrate (13) over the whole field of ty, 
from 0 to 00 . This gives 

Hence from (9) and (14) above, the percentage points of q are given by 


(1- 


■“>" 4 “J.' 




Tf 


tan-i(^^ 29 '„), 


and hence 


?a(l,2) 


1 


IT 


tan ~(l-a)U-^rCot 


fna 

1 




(16) 


The values of q^ determined from (15), for the six values of a under consideration, are 
multiplied by d^ — 2 /^ 7 r to give the required percentage points of the distribution of 

u^u{lf2). 

Special case n = 3, m — 1 

For random samples of size ^ = 3 from a normal distribution with unit standard deviation, 
the distribution of the range has been found by McKay & Pearson (1933) and takes the form 




Jo 


e-^z. 


Again, since w and x are independent, their joint distribution is given by 


p{w,x)dwdx 


TT^TT 


d,wdx 




e-i^^dz. 


(16) 


Transforming to new variables q — xjw and w, it follows from (16), since the Jacobian of 
the transformation is equal to w, that the joint distribution of q and w is given by 


Q 

p{q, w) dqdw = — j- e~^'^^^^+^'>wdwdq e"**’ dz 

TT yTT J Q 


(17) 
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To obtain the distribution of q the expression in (17) has to be integrated over the whole 
field of w, from 0 to oo. Thus 


pk) 


TT^TT 


0 ^wJV^ 


Jo 


Putting { = z^Q, the above may be written 

O /*oo fU> 

Jo J 


-3 r 1 


7r^(Fi7r)l_(g* + |)^ 
3 


4" 


1 

Jo 


n^tGn) (g^+ l) 

The first expression in (18) is clearly zero and hence 


'12 dtf)^ 


pig) 


3 


l*ia 

J.'" 


n^!{&n) (V+ I) 

. V3 

' 27r(g* + i)(f/ + |)*' 


I) ^20 


As before 
and therefore 


(l-a) 


V3 f"* 


lo Tg*+i )'{«*+D* * 1(2+3^)i 


dq 


tan 






If tan || (1 - a)| be denoted by t, then 

?«(1.3) 


■'I 


(18) 


(19) 


( 20 ) 


V2 t 

(l_3T2)i* 

The six required values of g„ are found by substitution of the corresponding values of a in 
(20) above, and further multiplication by dj = ^l^jn gives the percentage limits of the 
distribution of = u(l, 3). 


General case 4, m= 1 

For w ^ 4 no suitable algebraic expression exists for the distribution of the range, but 

l*w 

PearBon & Hartley (1942) have tabulated values of the probability integral I p(v)) dto to 

Jo 

4 figures at intervals of 0*06 of to for values of n from 2 to 20» Hartley kindly lent manuscript 
tables of the integral tabulated to 6 figures at intervals of 0*25 of w for values of n from 
2 to 16. Using these five-figure tables for 7i = 4,6,10,16 and the four-figure tables for u ==: 20, 
the frequency distribution of to was obtained numerically by subtraction of successive values 

of I p(to) dw at intervals of 0*25 and then converting these class frequencies into ordinates 
Jo 

y(w). The degree of approximation in the formula used implied the vanishing of fifth 
diiferences (see K. Pearson, Tables for Statisticians and BiometrioianSy Part ii, p. xvii). 
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Each case was treated in turn, and the six values of the percentage points corresponding 
to the six different values of a under consideration, were determined using the relations 
given in (10) above. Taking a trial value of q^^ the integrals 

were calculated at intervals of 0*25 over the whole range of w. Quadrature was then applied 
to the products y{w) I[w, over the range Q^w<qo, to obtain the value of (1 — a) corre¬ 
sponding to the trial value of q^. This procedure was repeated a number of times to obtain 
values of (1 corresponding to a series of equidistant values of The required values of 
q^ corresponding to the six values of a under investigation were then obtained by backward 
interpolation. 

As 00 the ratio wjd^ tends to the population value of the standard deviation. Further¬ 
more, for = 2 and == 3, exact values of q^ had been previously obtained by direct cal¬ 
culation for the six values of a. Thus it was possible to make initial estimates of the required 
values of and the process of this ‘trial and error’ method was not found too laborious. 


Table 1. Framework values of percentage points qfu = %{\ ^n) 


a 

n 


0*06 


0*01 



2 

6*0376 

10*1381 

25-389 

50*791 

263*97 

507*96- 

3 

2-6936- 

3*8226+ 

6*188 

8*819 

19*84 

28*08 

4 

2-1793 

2*9506+ 

4-213 

6*420 

9*42 

11*76+ 

6 

1*9354 

2*4766+ 

3*249 

3*900 

5*71 

6*66 




2*807 

3*244 

4*32 

4*82 

16 

1*7496 

2*1386+ 

2*628 

2*990 

3*82 

4*19 

20 {a) 



2-576 

2*916 

3*69 


20(6) 

1*7314 


2-676 

2*916 

3*69 

■■ 


1*6449 


2-326 

2*576 

3*09 

3-29 . 


The framework values of the percentage points of w(l,?^) were obtained by multiplying 
the values of q^ by the corresponding values of the mean range tabulated by Tipj^ett 
(1925) and are given in Table 1, together with the exact values for n = 2 and n = Z 
determined above. 

As a check on the accuracy of determination of these percentage points, the six values 
for ^ = 20 were also calculated by a second method. Writing 

r = l/g- = djx, (22) 

the method is to determine, for the six values of a, the corresponding values of q^, such that 


cc = p(r)dr-\- p{r)dr, 

J —00 J V^cc 


(23) 


where y = p[r) denotes the frequency distribution of r. Since q is distributed symmetrically 
about zero, its reciprocal r is also distributed symmetrically about zero, and from (22) 
and (23) it follows that 


p{r)dT 


pco 

= 2 fix) dx 

Jo 


rxlQa 


A 

p{w) dw. 


(24) 
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Ordinates of the normal curve y{x] = taken at intervals of 0-25 for x in 

the range 0:^a:<oo from K. Pearson’s Tables for Statisticians ami Biometricians, Part i. 

Taking a trial value of 1 /?„, the integrals J p{w) dw were calculated from Hartley’s four- 

figure tables for each value of x in the above range. By quadrature applied to the products 

p{x) p(w) dw the integral (24) was evaluated. By taking a series of equidistant values of 
Jo 

Ijq^ other trial values of a were determined. Backward interpolation was then used to 
obtain the required values of corresponding to the six values of a under consideration. 
Finally, the six percentage points of u were determined by multiplying the values of by 
dgo given in Tippett’s table. These percentage points are given in the penultimate line (6) 
of Table 1, and comparison with the corresponding values in the line above, (a), indicates 
good agreement between the two methods of computation. 

Since the percentage points of n for » = 4, 6, 10 and 16 have been determined using 
Hartley’s five-figure manuscript tables of the cumulative frequency distribution of w, they 
should be at least as accurate as the percentage points for n = 20 determined from the 
four-figure tables. 

Changes in the percentage points at the different levels of significance run most smoothly 
if arguments proportional to 1 jn are used in place of n, and reciprocals of u for the variate. 
Using a six-point general Lagrangian formula applied to the points corresponding to 
ft == 3,4,6,10,16 and 20, values of percentage points of w were determined for n = 6,1, 8,14 
and 18. (In the case of n = 20 the mean values of the percentage points determined by the 
two methods were used.) The interval was then halved, using a nine-point Lagrangian 
through points corresponding to u = 4, 6, 8, 10, 12, 14, 16, 18 and 20. Finally, the six sets 
of percentage points were differenced as a check, reduced by either one or two figures and, with 
the exception of those for n — 6, are given in the second columns of Tables 3-8 under m — 1. 

For w = 6, the six percentage points of u were independently determined at a later stage 
of the investigation by the method used above for w = 4, 6, 10, 16 and 20, and it is these 
values which are given in Tables 3-8. In the table below, the values obtained by inter¬ 
polation from the framework values are compared with those determined by direct 
calculation. 


By direct calculation 
By interpolation 


Percentage points of v,{l, 6) 


a= O'lO 



0*01 

0^002 

0-001 

4‘38 

6‘8 : 

8-2 

4*38 

6*8 1 

8’2 


In one case only, for a = O'lO, is there a difference as much as one unit in the last figure, 
the actual values obtained being 2-0192 by direct computation and 2-0198 by interpolation. 

Taking all the various checks into consideration, it appears unlikely that the values of 
the percentage points of u(l,n) given in the tables at the end of the paper are in error by 
more than One unit in the last place. The values for = 2 and n — 3 are, of course, exact. 
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(v) Computation op percentage points op the distribution op 
u = n{2, n), i.e. case with m = 2 

The exact distribution of u(2, n) cannot be determined analytically except in the case of 
= 2, and hence the various percentage points have necessarily to be evaluated almost 
wholly by numerical methods. 

The probability of the joint occurrence of a pair of ranges and^^;" from random samples 

of equal size n from a normal population of unit standard deviation is given by 

dw'dw" = p(w')p{w'^) dw'dw". (25) 

If w be the mean value of the two ranges, then its distribution is obtained by integrating (25): 

pi^)dw = ^p(w') p[w'') dv)^ dw'\ (26) 

the integration being taken over the whole field of w* and subject to the conditions 

{O^w'<co, O^w" <co)» (27) 

We shall change the variables from and w" to w and w\ the Jacobian of the transformation 
being equal to 2. With these new variables, and noting from (27) that varies from 0 to 
2Wj equation (26) gives 

p{w) — 2 p{w^)p{2w — w')dw\ (28) 

Jo 


Special case w — 2, m = 2 

In equation (11) above is given the distribution of the range in satnples of two random 
values from a normal population with unit standard deviation. Substituting this in (28) 
above, the distribution of means of two independent values of w is therefore given by 


p{w) 


'2u? 2 

Jo 




O f 2UJ 

TT jo 


Using the notation 


8 

V(27r) 


C2W 1 

Jo V(27r)' 


dz. 



(29) 


tbe expression (29) for the distribution of the mean range may be written in the form 


p(w) = 


a/(27^) 




(30) 


We may now proceed to determine the distribution of the ratio q — a^lw. Since they are 
independent, the joint distribution x and w is, from (6) and (30), given by 


p{x^w)dxdw • 


IT 


I{w)dxdw. 


(311 
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Transforming to variables q and w, noting that the Jacobian of the transformation is equal 

to w, and integrating, we obtain 

(32) 


P(?) 


4 

^Jo 


I{io) wdv). 


Now since 

we make use of the identity 
d 


dljw) ^ 1 IS* 

dw ^(271) 


I{Td)} = + 




to evaluate the integral in (32) and obtain 

4 


P(q) = 


Vv(27t) (gHl)Jo 


}■ 




The ratio q is distributed symmetrically about zero and from (9) and (33) we obtain 


(33) 


1 —a = 


Qa 


4f_ 

ttJo (?^+1)'n/(2^+2)’ 


dq 


and the percentage points are therefore given by 

= 3«(2»2) = 


(34) 


where r = tan | J (^ ^ ^)| • 

Substitution of a - 0*10, 0’05j etc,, in (34) gives the required values of and further 
multiplication by ^2 = 25/^7r gives the required corresponding percentage points of the 
distribution of *^(2,2). 


General case n^S, m — 2 

For ^ = 4, 6, 10, 16 and 20, the ordinates j/(ty), of the distribution of the range have been 
previously evaluated above at intervals of 0-25 for w, and these are used in place of the 
unknown p(w). Taking a particular value of w, quadrature was then applied to the products 
y(w)y{2w--w) to obtain a numerical estimate of p(w) from equation (28). This process was 
repeated at intervals of 0-26 for w through as much of the range 0 < < oo as was necessary 

to obtain the required degree of accuracy. For n — 3, determination of the ordinates of the 
distribution of w was also based on quadrature, but, in this case, exact figures for the 
ordinates of the distrihution of the range were obtained from its equation found by McKay 
& Pearson (1933), Because of the rapid rise of the distribution near to the origin, estimates 
of the lower values of p(I^) in this case were determined using an interval of ^ for w in order 
to obtain the requisite accuracy. For higher values of w the interval was progressively 
decreased to 0^26 used over the tail portion of the curved 
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Treating in turn each curve of the distribution of w (for 72 . = 3, 4, 6, 10, 16 and 20), the 
values of the percentage points of the distribution of the ratio 


u X 



were computed by a method similar to that used in evaluating the percentage points of g 
for the case m = 1. Taking trial values of the integrals 


2 

were calculated at intervals of 0*26 for w. Quadrature was then applied to the products 
y{w) I{w, q^) over the range 0 ^ w < 00 to obtain corresponding values of (1 — a), the sum of 
the two tails of the distribution beyond deviations ± Repeating this procedure, a series 
of values of (1 — a) were obtained, corresponding to a set of equidistant values of Back¬ 
ward interpolation was then used to obtain the six values of corresponding to the six 
values of a under consideration. Finally, the required percentage points of u were obtained 


Table 2. Framework values of percentage points ofu=^ ^(2, n) 


a 

n 

0-10 

0-06 

0*02 

0-01 

0-002 

0-001 

2 

2-6203 

3-8671 

6-266 

■ 

8-932 

20-10 

28-47 

3 

2'0201 

2-6366+ 

3-665+ 

4-340 

6-78 

8-66 

4 

1'8760 

2-3672 

3-047 

3-600 

5-07 

6-80 

6 

1*7791 

2-1920 

2-727 

3-133 

'4-11 

4-62 

10 

1-7226+ 

2-0926 

2-551 

2-884 

3-64 

3-96 

16 

1-6961 

2-0470 

2-472 

2-775+ 

3-44 1 

3-71 

20 

1-6879 

2-0329 

2-449 

2-742 

1 

3-38 ' 

3-64 


by multiplication by the appropriate value of and are given in Table 2, together with 
the exact values forn = 2 determined from (34). 

As before, Lagrangian formulae were used to interpolate the intermediate values of the 
percentage points of u^ again taking arguments proportional to Ijn and reciprocals of the 
percentage points for the variate. As a check, the values were inspected by determining 
differences up to the third order and then reduced by one place of decimals. With the 
exception of the values forn = 6, the reduced values are given in Tables S-S, 

For n = 6 the six percentage points of u were independently determined at a later stage 
of the investigation by the same methods used for the framework values. These directly 
computed values are given in the tables mentioned above, and are also reproduced below 
for comparison with those obtained by interpolation from the framework values. 


Percentage points of ‘u — u{2,^) 


a == 

0-10 

0-06 

0-02 

0-01 

0-002 

0-001 

By direct calculation 

-1-814 

2-264 

2-84 

3-29 

4-4 ' 

6-0 

By interpolation 

1-’814 

i 

2-254 

_I 

2-84 

3-29 

4-4 

4-9 
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In the case of <x = 0-001, direct calculation gives a value of 4-97 compared with 4-92 
obtained by interpolation. Por other percentage levels the agreement is exact to the number 
of figures quoted. 


(vi) Computation op puroentage points op the distribution of 

= t((m, n) FOR m>2 

The variance of the mean range in m subgroups of equal size n steadily decreases as m 
increases, and the ratio w{ 7 ii 3 %)ldf^ gives closer estimates of the population value of the 
standard deviation of the variate. Hence, following the usual methods of large-sample theory, 
the limiting values of the percentage points of u, for indefinitely large m, may be determined 
from integral tables of the normal curves. For a given value of a, the limiting values of the 
percentage points are, of course, equal for all values of and are also equal to the corresponding 
limiting values of the percentage points of Fisher’s t-distribution for an indefinitely large 
number of degrees of freedom. 

In general it was found for a particular value of a and of n that a three-point Lagrangian 
curve with l/m as argument and reciprocals of the percentage points of u as variate (passing 
through points corresponding to w = 1, 2 and cxd) may be used for interpolation of the 
required percentage points corresponding to values of m intermediate between 2 and oo. 
Only in the case of 2 and w = 3 was the required accuracy not attained by this procedure 
and further investigation found necessary. Details of the methods used are given below. 

In the case of w = 2, the percentage points of the distribution of 2) were also deter¬ 
mined for m = 4 and w — 8 as follows. First, considering m = 4, it was necessary to obtain 
numerical estimates of the ordinates of the distribution of the means of four ranges, each 
range from a random sample of two values from a normal population with unit standard 
deviation. Following the method used for m = 2 and leading to equation (28), it is easy to 
show that the distribution of io(2m,u), the mean of 2m independent ranges each from a 
random subsample of n values, is given in terms of the distribution of the mean, ^(m, u), of 
m such ranges by 


p{w(2'm^n)) = 2 


p{w{my n))p{2w{2m, n) — w;(m, n)) n). 

0 


(35) 


Using numerical values for p{w) (m = 2, n - 2) given by equation (29) above, estimates 
of the ordinates of the distribution of p{w) for m = 4, n = 2 at intervals of 0- 26 were found by 
quadrature methods similar to those described in previous sections by using the above 
expression, A repetition of this process, using these last computed values, yielded numerical 
estimates of the distribution of the means of eight ranges, i.e. m = 8. Again applying quad¬ 
rature to the two distributions, values of (1oc.) were determined for a series of equidistant 
values of (m = 4 and w = 8). The required values of corresponding to the six values 
of cc between 0*10 and 0*001 under consideration, were then obtained by backward inter¬ 
polation, and hence the percentage points of u{4:, 2) and u(S, 2). 

The sets of six jpercentage points of u(i7if 2) were determined for each required value of m 
by Lagrangian interpolation, reciprocals of m being used as argument and reciprocals of 
the corresponding percentage points as variate, the curve passing through the points corre¬ 
sponding to w = 1, 2, 4, 8 and co. The interpolated values of percentage points obtained by 
this method, and the directly computed values for ^ = 4 and wt = 8, are given in Tables 3—8 
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for a series of values of m suitable for practical use. As a check upon the method, the com¬ 
putation was repeated, this time using a four-point Lagrangian passing through points 
corresponding to m = 1, 2, 4 and oo. Most of these interpolated four-point values of the 
percentage points agree exactly with the five-point values previously obtained. In cases 
where differences arise, none exceed 1 unit in the last figure. The five-point Lagrangian 
method of interpolation therefore certainly appears to be quite adequate for furnishing the 
required degree of accuracy. 

Numerical estimates of the distribution of the means of pairs of ranges from subsamples 
of size w = 3 and = 4 have already been obtained above. Using these in turn in equation 
(35), estimates of the ordinates of the distributions of the means of four ranges were deter¬ 
mined at intervals of 0*25 by quadrature methods. The sets of percentage points of '^^(4,3) 
and u(4, 4) were then computed by the previous method of trial values and subsequent 
backward interpolation. 

For = 3 and n = 4:, the percentage points of the distribution of for given values of Wi, 
are lower and nearer to their limiting values than the corresponding points for n — 2. 
Furthermore, the changes in the values of the percentage points for small values of m are 
also less abrupt. In view of the agreement between the four-point and five-point Lagrangian 
interpolated values of the percentage points for n = 2, a four-point Lagrangian through 
points corresponding to m — 1,2,4 and oo may certainly be relied upon to give adequate 
accuracy for the interpolation of percentage points corresponding to intermediate values 
of m in the case of tz, = 3 and n = 4. These values, together with the computed values for 
m = 4, are given in Tables 3-8. 

For the remaining values of n, from 5 to 20, the interpolated percentage points given in 
Tables 3-8 have been obtained by means of a three-point Lagrangian curve, using values 
of the percentage points corresponding to values of m = 1,2 and oo. As in the previous cases 
of interpolation, reciprocals of m and the variate were used in order to obtain small changes 
in successive differences. To show that this method is adequate, the six sets of percentage 
points for n = 4 were also interpolated using a three-point Lagrangian. In every case except 
one, these values agreed exactly with the four-point Lagrangian interpolated values pre¬ 
viously found and given in Tables 3-8. In the case of the sole exception, the difference 
between the two interpolated values was only one unit in the last figure. For the less rapidly 
changing values of the percentage points of u for the three-point method of inter¬ 
polation therefore provides sufficient accuracy for the present purpose. 

Taking all checks into consideration it appears that the tabulated values of the per¬ 
centage points of the distribution of the function u = t^(m, n) may be relied upon to the 
accuracy given: occasionally the values may be one unit in error in the last figure, 

In Tables 3 and 4, the 10% and 5 % points of u were computed to 3 decimal places, 
but lack of space has necessitated these being curtailed for publication. For the same 
reason the values of the percentage points of u for the odd values ofn==:ll,13..,19 have 
been omitted. In practical applications of the test, it is not considered that this reduction 
will cause any undue inconvenience. Fuller tables have, however, been retained for 
forthcoming work on the power of the ^i-test and are available for consultation if required. 

(vii) Approximate values op the percentage points op u 
If there are m subgroups each of n values, and if the estimate of standard deviation is 
determined as the root-mean-square of the deviations of variate values from the respective 
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means of the subgroups, then the nunxber of degrees of freedom is v = m{n-1 ). Unlike the 
usual «-test, when the estimate of standard deviation is determined from the mean range in 
in subgroups of equal size n, the percentage points of the modified i-distribution investigated 
above depend upon the relation between m and n. Reference to Tables 3—8 indicates that, 
for a constant number of degrees of freedom p = m{n-l), the values of the percentage points 
on a given probability level vary slightly as m and n vary. For example, taking a = 0-05 
and r = 8, we have the following percentage points: 2'272 for m = 1 and «. = 9, 2-254 for 
m — 2 and =? 5, 2-250 for wi = 4 and n ~ 3, and 2-264 for m = 8 and n= 2. In general, 
however, the range in the values of the percentage points of u for a given value of v is small, 
and this permits the construction of a table giving approximate values of the six sets of 
percentage points corresponding to different numbers of degrees of freedom. 


Approximaie values of percentage points of u 


Degrees of 
freedom 

Values of ct 







P = wi(n— 1) 

0^10 

0-05 

0-02 

0-01 

0-002 

0-001 

1 

5-0 

10-1 

26*4 

50*8 

254-0 

607-9 

2 

2-6 

3-8 

6-2 

8-9 

19-9 

28-3 

3 

2*2 

3-0 

4-2 

5-6 

9-4 

11-8 

4 

2-0 

2-6 

3-6 

4-4 

6-8 

8-3 

6 

1*9 

2-6 

3-3 

3-9 

6-7 

0-7 

6 

1-9 

2-4 

3*1 

3-6 

6-1 

6-9 

7 

1-8 

2-3 

3-0 

3-5 

4-7 

5-4 

8 

1-8 

2-3 

2-9 

3-3 

4-5 

6-0 

9 

1*8 

2-2 

2-8 

3-2 

4-3 

4-8 

10 

1-8 

2-2 

2-7 

3-1 

4-2 

4-6 

ll 

I'S 

2-2 

2-7 

3-1 

4-1 

4-6 

12 

1-8 

2-2 

2-7 

3-1 

4-0 

4-3 

13 

1-8 

2-2 

2-7 

3-1 

3-9 

4-3 

14 

i‘7 

2-1 

2-6 

3-0 

3-8 

4-2 

15 

1-7 

2-1 

2-6 

2-9 

3-7 

4-1 

16 

1-7 

2-1 

2-6 

2-9 

3-7 

4*1 

17 

1-7 

2-1 

2*6 

2-9 

3-7 

4-J 

18 

1-7 

.2-1 

2-6 

2-9 

3-6 

4-0 

19 

1-7 

2-1 

2-6 

2-9 

3-6 

4-0 

20 ! 

1-7 ! 

- _ i 

2-1 

2-6 

2*8 

3-6 

3-9 

30 

1 

2-0 

2-4 

2-7 

3-4 

3'6 

60 

1-7 

2-0 

2-4 

2-7 

3-2 

3-5 

120 

1-7 

2-0 

2-4 

• 2-6 

3-2 

3-4 

CO 

1-64 

1-96 

2-33 

2-68 ^ 

3-09 ^ 

3-29 


Tor a particular pair of values of m and n, the values of the percentage points for 
V ~ m(w-1) degrees of freedom given in the table above are generally not in error by more 
than one unit m the last place of figures. This degree of accuracy is frequently sufficient for 
many practical applications of the distribution of u. To settle the significance of cases giving 

va ues of « close to the above approximate values, reference should be made to the accurate 
values given in Tables 3-8. 


(viii) Applioa.tions oi? the u-test 
The difference between the mean of a sample of n random values 
variate and the population value is shown in the Appendix to be 


of a normally distributed 
independent of the total 
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range in the sample, and also independent of the mean range determined from random sub¬ 
groups of values. The modified if-test based on range estimates of standard deviation may 
therefore be used in various statistical tests of significance involving deviations of sample 
means. The application of this range test to sampling problems is analogous to that of the 
well-known ^-test, and no detailed description is therefore required. The most frequent use 
of the new test will be found in the treatment of experimental data of various types, and 
also in the examination of test results recorded for the purpose of control of the quality of 
industrial products. In this latter type of work, cases frequently arise when it is desirable to 
apply a rapid test for determining the significance of a difference between the mean of a 
sample and some preassigned value, frequently some desired control level, or the significance 
of the difference between two sample means. Furthermore, for routine purposes, it is often 
desirable that the test should not only be rapid but also of a simple nature, thus enabling it 
to be used by workers with little mathematical or even arithmetical aptitude. The new range 
test has the advantages of greater simplicity and greatly reduced amount of computing 
compared with the standard i-test. The use of range estimates of standard deviation, in 
place of root-mean-square estimates, necessarily entails some loss of precision, but in a future 
paper it will be shown that this reduction in accuracy is small and certainly negligible for 
most practical purposes. 

The most frequent applications of the range test are considered below and are followed 
by several numerical examples in which, for purposes of comparison, the parallel treatment 
by the i-test is also given. As in the it-test, the application of the range test involves the 
assumption of normality of variate distribution and randomness of sampling. Furthermore, 
where the standard deviation is estimated from the mean range of several subgroups of 
values, care should be taken to ensure that the arrangement of these values is also random. 
This latter condition is usually fulfilled by considering the values in the order in which they 
were originally recorded. In a few cases, however, the order of recording may not be random; 
the particular circumstances of a test may be such that the order of the observations may be 
wholly or partly dependent upon their magnitude. In such cases a set of values can be 
divided into random subgroups by the use of tables of random sampling numbers or by 
other means. 


(a) Difference between sam'ple mean and population mean 

Suppose we have some preassigned value and wish to test whether the mean ^ of a 
sample of N values may be considered as a reasonable estimate of or whether the difference 
between x and ^ is real in the statistical sense. The usual assumption, the so-called' Student's 
Hypothesis', is made that x is the mean of a random sample from a normal population of 
which the mean is g and standard deviation is o*. The differences .{x - will be distributed 
about a mean of zero with a standard error equal to o'IaJN. If the sample be divided into m 
random subgroups of equal size 71 , N — 7nn, and w is the mean of the m ranges of the subgroups, 
then the sample estimate of the standard error of the mean is wKd^^^N). The ratio of the 
difference between the means to the estimate of its standard error is 

( 36 ) 

w ■ ^ ' 

If the computed value of u exceeds the corresponding percentage point in one of Tables 
3-8, then the difference is considered unlikely to have arisen through random sampling on 
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that particular probability level a. As in the case of the i-test, when considering the asym¬ 
metrical case of ‘Student’s Hypothesis’, the values of a at the headings of the tables should 
be halved, 

For fairly small values of Ny the estimate of the standard error of the mean may be deter¬ 
mined, not from the mean range in subgroups, but from the total range between the maximum 
and minimum values in the sample. In the notation used above, this corresponds to m = 1 
and n The test of the significance of the difference may be made as above and the 
computed value of u compared with the percentage points in Tables S-S. In these cases, 
however, the computation may be curtailed by using the ratio 

^ (37) 

W ’ 

where S, and w is the range in the undivided sample. Table 9 gives values of the 

ratio djw for various levels of significance corresponding to the sum of the two tails of the 
distribution, For a chosen level of significance the difference S is considered too large to 
have arisen through random sampling errors if the value of Sfv) exceeds the corresponding 
tabulated value. Table 9 will also be found useful fox giving a rapid estimate of the accuracy 
of the mean based on a small number of observations. 


(6) Difference between two sample ineans 

Suppose the first sample of siae be divided into random subgroups of size n, and the 
second sample of size be divided into random subgroups also of size n, i.e. 


n ^ NJmi - 

The hypothesis is made th^t each sample can be considered as a random selection from the 
same nox^mal population. Let the numerical value of the difference between the two sample 
means be j - ^ 2 1 j ^^d^ the mean of the 4* m^) ranges of n values hew ^ -h ^^ 2 . 
giving an estimate for the standard deviation of the variate. The ratio of the difference 

between the two sample means to the range estimate of the standard error of the difference is 


u = 


I 1 


The significance of the difference between the means in any particular case can be deter¬ 
mined by noting whether the computed value of u exceeds the corresponding percentage 
point for a chosen value of a b 3 ^ reference to Tables 3-8, using the column headed m ^ rrii 4- 
When the samples are small and of equal size, say the variate standard deviation can 
be estimated from the two total ranges in the samples. If w* and are the two ranges^ with 
a mean value w ^ ^[w' then 


u 


\^i-^2\dn^J(hn) 


w 


(39) 


may be used as above for testing the significance of the difference between the two means. 
A more ra])id test may, however, he made by simply determining the value of the ratio of 
the difference between sample means to the average of the two sample ranges 

liw' + w") d,,^J{hny 

In Table 10 are given values of the above ratio lying on six different probability levels. For 
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a given level of significance a, values of the ratio smaller than those tabulated may be con¬ 
sidered to have arisen through random sampling errors; greater values indicate that a given 
difference is unlikely to have arisen through chance and therefore point to a real difference. 

In the computation of u it is necessary to use values of the mean range in samples from 
a normal population of unit standard deviation. A selection of the values determined by 
Tippett (1925) is reproduced in Table II to avoid the necessity of frequent reference to his 
original paper, and is accompanied by the corresponding values of aJti and 

(o) Confidence intervals 

As with ‘Student’s’ test, the tables of percentage points may be used to estimate with 
a given measure of confidence, the interval within which it can be stated that ^ or ~ ^2 lies. 

ExanifUs 

Example 1. The following data have been previously used as an example by ‘Student’ 
(1908). Ten patients were treated with the optical isomers of hyoscyamine hydrobromide 
and the additional hours of sleep were noted. 


Additional hours sleep gained by use of hyoscyamine hydrohromide 


Patient 

Dextro-(D) 

Lae'VO'-(L) 

Difference {D — L) 

1 

+ 0-7 

+ 1-9 

+ +2 

2 

-1*6 

+ 0*8 

+ 2'4 

3 

^0-2 

+ M 

+ 1*3 

4 

-1*2 

+ 0-1 

+ 1-3 

5 

-0*1 

-0*1 

0*0 

6 ' 

+ 3*4 

+ 4*4 

+ 1*0 

7 

+ 3*7 

+ 5*5 

+ 1*8 

8 

+ 0*8 

+ 1*6 

+ 0*8 

9 

0*0 

+ 4*6 

+ 4*6 

10 

+ 2*0 

+ 3*4 

+ 1-4 

Means 

+ 0-75 

+ 2*33 

+ 1*58 


The last column may be used for the controlled comparison of the two drugs, since their 
effects were measured on the same ten patients. The laevo form has given a greater figure for 
the additional hours sleep than the dextro form. Whether the former may be considered as 
the better soporific is examined by both the standard deviation and range tests. 

{a). The sum of squares of deviations of the differences about their mean value is 13*616, 
associated with 9 degrees of freedom. The estimate of the standard error is therefore 0*3890, 
and the value of t works out to be 1*58/0*3890 = 4*06. For 9 degrees of freedom a value of 
t = 3*250 lies on the 1 % level of significance. Assuming normal random sampling, a value 
of t equal or greater than 4*06 will occur much less frequently than once in a hundred times. 
This leads to the conclusion that the laevo form is better for producing sleep than the dextro 
form. 

(6) For examination by the range, the value ofu — u[\^ 10) may be computed, but in this 
case it is simpler to use the shortened method of equation (37). The ratio of the mean differ¬ 
ence to the range in the ten individual differences is Sjw = 1*58/4*6 = 0*34. Reference to 
Table 9 shows that this value is slightly in excess of the tabulated value 0*333 on the 1 % 
level of significance, leading to the same conclusion as that drawn from the ^-test. 
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The greater significance suggested by the ^test seems to be largely due to the exceptional 
difference D-Lfov Patient No. 9, viz 4*6, which affects 5 more seriously than w. 

Example 2 . In the calibration of a viscometer it is necessary to time the interval required 
for the lev^el of an aqueous solution of glycerol to fall between two fixed marks, For satis¬ 
factory calibration it is considered desirable that the mean time of flow should be accurate 
to ± I see .3 risking a greater error not more frequently than 1 in 20 times. Five independent 
determinations of the time interval (in seconds) for one viscometer were 103-5, 104*1, 102-7, 
103*2 and 102*6. While this number of observations is clearly too small for a final assessment 
of accuracy, it is often useful to get an interim answer to guide further action. 

(а) The sum of squares of the deviations of the five observations about their moan is 
1*60H, associated with 4 degrees of freedom, giving an estimate of the standard error of the 
mean equal to 0*275, Reference to tables shows that a value of t equal to 2*776 lies on the 
5 % level of significance. Hence in 19 times out of 20 it would be expected that a sample mean 
will not diverge from the true mean value by more than ± 2*776 x 0*275 == ± 0*76 sec. This 
error exceeds the assigned limits of ± J sec, and therefore points to the necessity of further 
tcvsts to fulfil the required conditions. 

( б ) Instead of computing an estimate of the standard error of the mean from the range 

{w ~ 104-1 - 102*6 1*5) in the five determinations, we note from 'fable 9 that a value of 

8jw = 0*507 lies on the 5% level of significance. Hence in 19 times out of 20 the sample 
mean will differ from its true value by an amount up to a deviation of 

±S ^ ± 0*507 X 1*6 = +0*76, 
a result in agreement with that yielded by the i-test. 

Example 3. In the processing of raw cotton, modificatiox\s were made \n the design of one 
of the machines with the object of improving the efficiency of cleaning. Teets w^ere made 
on a series of 24 different mixings for the purpose of deteruiiuixig whether yarn strength was 
adversely affected by the mechanical alterations. The results of tlie 24 pairs of comparisons 
are given below (the strength being expressed as a count x strength product), together with 
the differences between them expressed as percentages of the corresponding strengths under 
standard conditions. 


Yam strengths under standard and modified conditions 
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The mean value for the percentage difference in strength is - 0-46. Whether this is an 
indication that the mechanical modifications have resulted in the production of weaker 
yarns is examined by means of the standard deviation and range tests, 

(а) The sum of squares of the deviations of the percentage differences about their mean 
value is 256*48, based on 23 degrees of freedom. The estimate of the standard deviation of 
the percentage differences is 3’34 and the standard error of their mean value is 0-68, giving 
a value of t = 0-46/0'68 = 0*69. This is much below the value of 2-069 on the 6 % level of 
significance and leads to the conclusion that there are no grounds for suspecting that the 
mechanical alterations have led to the production of weaker yarns. 

(б) The number of observations place this case outside the range of Table 9, and it is 
therefore necessary to use the modified ^function. The data are arranged in random order 
of their occurrence, and split into four groups of six, The ranges in the sets of six differences 
are 7*5,9*8,12*8and5*9 withameanvalueti;(4,6) — 9*0. The estimate of the variate standard 
deviation is ^(4,6)/dg == 3*56, giving 6*72 for the standard error of the mean percentage 
difference, and n ^ 0*46/0*72 = 0*64. The 6 % level of significance is, from Table 4, equal to 
2*07, much greater than the value computed from the data and therefore indicates the same 
conclusion as above. 

Example 4. Independent determinations of percentage trash content were made in tri¬ 
plicate on two samples of raw cotton and the following results obtained: 


Percentage trash content of raw cotton 


Sample A 

Sample B 

M3 

0-76 

1-31 

0-64 

1*26 

1*01 

Means 1*23 

0*80 


The point to be decided is whether sample B may be said to be cleaner than sample Aj 
or whether the difference between the two average percentage trash contents may be 
accounted for by random experimental variation. Since, in this case, the comparisons are 
not paired, the standard error of the difference between the mean values of the two samples 
has necessarily to be estimated from the variation within each of the two sets of results. 
As before, normal variation in sampling and in testing errors is assumed. 

(а) The sum of squares of deviations of each set of values from their mean is 0-01680 for A 
and 0-07167 for B, each associated with 2 degrees of freedom. The best estimate of the error 
standard deviation is therefore 0-149, giving 0-122 for the estimate of the standard error 
of the difference between the two means. The value of t is equal to (1*23 — 0-80)/0*122 = 3*6 
which exceeds the value 2-776 obtained from tables for 4 degrees of freedom and a == 0*06. 
On this level of significance the result is taken to indicate a real difference in the cleanliness 
of the two cottons. 

(б) The difference between the two sample means is 0*43 and the mean of the two ranges 

is 0*276. Hence, using the ratio of equation (40), j + = 1*^ which, from 

Table 10, is seen to exceed the value of 1*272 lying on the 5 % level and therefore is taken to 
indicate a significant difference in the mean values. 




^chfigG in pldcs of stando/fd d&victtion in the t-‘t68t 

Example 5 The Mlowing ateength test results vfere obtained on two batches of cotton 
yam (measurements recorded to the nearest i lb.) and are noted downwards in order of 
random occurrence: 


Bample A 

Sample B 

30-5 

31-0 

29*6 

27-0 

28-5 

28-0 

31*5 

27*5 

28-6 

25-0 

29-5 

30*0 

28-0 

26-6 

28-0 

28*0 

27-6 

26-0 

27-0 

27-5 

28*5 

29*5 

28-5 

27*0 

27-6 

29*5 

27-6 

26-6 

28-6 

28-6 

27*5 

28-0 

27-0 

28-0 

28-0 

1 28*0 

, 32-6 

30-0 

28*0 1 

26-0 

29*5 

28-6 

28-6 

26*0 

26-5 

30*6 

29-0 

31-0 

29*0 

28-0 


The mean of sample A is 28-90 lb. and 27-40 lb. for sample B, and the question arises as 
to whether sample B is actually weaker than A . 

(a) The suras of squares of the deviations about their respective mean values are 68-2 
for A and 24-8 for B, associated with 29 and 19 degrees of freedom. The estimate of the error 
standard deviation is therefore 1-392, giving 0-402 for the standard error of the difference 
between the two means and a value of t equal to (28-9 — 27-4)/0‘402 = 3-7. Tor 48 degrees of 
freedom a value of t = 2-08 lies on the 1 % level of signifioance, The greater value of 3*7 
yielded hy the data above indicates, therefore, that the difference in strength of the two yarns 
may be accepted as statistically significant. 

(b) The estimate of the error standard deviation is obtained from the ranges within groups 
of ten values, three groups for sample A and two for sample B. The values of these five 
ranges ate 3-0,6-0,6-0, 4-0 and 3-5 with a mean value of 4-1 and a corresponding estimate of 
error standard deviation equal to «;(6,10)/dio ~ fThe estimate of the standard error of 
the difference between the two means is 0-384 and the value of % is (28-9- 27-4)/0-384 = 3-9. 
For five ranges of ten, the value of « on the 1 % level of significance is, from Table 6, equal to 
2-69 (of. 2-68 for t with 48 degrees of freedom). The value of 3-9 obtained from the data is 
greater than this value of 2-69 and this again leads to the conclusion that the difference in 
mean strengths of the two yarns is ‘statistically significant’. 

Bote added in proof. Since the present paper went to press, a note by Daly (1946) has 
been published, in which it is suggested that the range may be used in place of the root- 
mean-square estimate of variance in a test analogous to the t-test. The case where the 
estimate of standard deviation from a single range is discussed and values of the ratio 
(deviation)/(range) on the 10% level of significance are given to two significant figures 
for a number of low values of n. These agree with the corresponding values given in 
Table 9, for a = 0-10, of the present paper. [Mr Lord’s paper was first submitted for 
publication in August 1945. En.] 
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APPENDIX 


On the independence of mean and some linear estimates of standard 
deviation in random samples from a normal population 

In the above practical applications of the u distribution to normal random sampling pro¬ 
blems, it has been implicitly assumed that range estimates of standard deviation, like root- 
mean-square estimates, are independent of the mean of the sample from which they have 
been determined. The validity of this assumption is established below, where it is shown as 
a particular case of a more general theorem. 

Consider a set of n random values of a variable from a normal population of distribution 

Of such a set let Xjj and denote the^th and qth values {p < q) in ascending order of magni¬ 
tude, and denote the remaining (w —2) values such that 


r = 1,2 ,(p-1 ), \ 

r (p*f 1), (p + 2), ...,(^--l),|‘ (2) 

r = (?+!),(<? +2), J 

Now a set of any n values may be arranged in n ! ways and in random samples all arrange¬ 
ments of the same n values are of equal probability. The group of (p — 1) values all less than 
Xjj are not ranked in any particular order, and there are hence (p — 1)! ways in which they 
may be arranged. Similarly, the group of values from to may be arranged in 
(g—P“ 1)! ways and the third group from x^^^ to in [n—q)\ ways. The distribution of 
random samples in which the pth and gth values in ascending order are denoted by x^ 
and ojg, and the remaining values satisfy the conditions in (2), is therefore given by 


...da:, = [ (^^i)!(P^Li)i(^^g)! 


p 2 r>^n "I 

(L " ^ r5l J 


( 3 ) 


where the constant term in brackets makes the total frequency of all such samples equal to 
unity. 

The joint distribution of the sample mean (^) of the n values and the difference A — (Xq — 
is given by 

_ _ n^ 

p%s)txiA - 

where the multiple integral is evaluated over the domain of the cc’s conditioned by the limits 

r^n 

indicated in (2) and by ^ = (x^-Xj,) and k = - S ®r- 
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Range in place of standard deviation in the t-test 

Make the transformation to variahles defined by 

{*^ + 0:2 + 


_ 1 
x = - 
n 




yi = -Xp+^x, 


yp-i==~Xp 


+^p~i> 


Vp+i 




+ *, 


"334*1» 




(5) 


+ ^u- 

The Jacobian of the transformation is ^ -- _ i ^nd, from (2) 

^{^>yv;yp-i>yp+i<-;yn) 

and (6), the transformed limits of integration are 

-c»<y,^0, r = 

O ^ y ^^ A , r - ( p - tl),{p + 2 ),...,{ q -- l ), 

A^y,<co, r = (q+l),{q + 2),...,n. 

2/, = d. 

Using the relations in (5) above it may easily be shown that 

^i== 2 =l 


( 6 ) 


S S y„ysH> 

/f' r^l W 


’•=1 n n 

where in the summation on the right r,3,8 + t^p. With the new variables we have, from 
(^)> (5), (6) and (7), the joint distribution of x and A given by 


p{x,A)dxdA = 


exp 


n{x~i) 


r] 

■=^dx 




fn{n~-\)\dA 


V(27r)(j/^w 

j-« #p+i 

( r=n 

X (^-1) 


iJ-l)!(%-^)!(27r)«»-«o'«- 

/•» p I 

exp 


r=n sssTjt—l /.5=7i.~s 

2 S S 

r-1 


L 

|]%M 


... , (8 

with the restriction thatr,s,. + f:}=^, ^rxAA is Jbe substituted for y„. 

thereLrtli^+h bracket of (8) is the distribution of the sample mean S. It follows 
does not involve " ® second bracket is the distribution of A , because this expressior 

isiZnendertS magnitude 

taMardellZTf «««mates of the populatior 

teterqZt " ot ("-g- semi, 

corresponding sample mear ® measures of dispersion) are independent of thi 

Pl^l^ "-d ,th value. 

argument shows that there is also statistical a- simple extension of th( 

corre„,ondi.g m»„ range „f the "C^ps ““S'' “■* 
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Table 3. 10 % points ofu = u{m, n) 


10 16 


1-65+ 1-66 


1 -66+ D66-^ 1*65 


1*66 1*65+1 1*66+ 


Table 4. 5 % points ofu = u{m^ n) 


2 

10*14 

3*87 

2*98 

3 

3-82 

2*64 

2*37 

4 

2*95+ 

2*37 

2*22 

5 

2*63 

2*25+ 

2*15- 

G 

2-48 

2*19 

2*11 

7 

2*38 

2*15+ 

2*09 

8 

2*32 

2*13 

2*07 

9 

2*27 

2*11 

2*06 

10 

2*24 

2*09 

2*06'^ 


6 

2 * 

9 

2 * 

6 

2 

3 

2 

1 

2 

)1 

2 

)0 

1 

>0 , 

1 * 

)9 

1 

)9 

1 

)8 

1 

)8 

1 

)8 

1 

)8 

1 


Note, The numbers in brackets in the column headed m — 60 indicate the number of units which must 
be subtracted in the second decimal place to obtain the level for m = 120 and the same value of n. Where 
no figure is given u(l20,n) 60 , 11 ) to second decimal place accuracy. E.g. for the 6 % level, 

w( 120 , 2 )=: 1*98. 
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Mange in place of standard deviation in the t-test 


Table 6, 2 % joints ofw = m(w, n) 


n \ 

1 

2 

3 

4 

5 

6 

8 

■ 

10 

16 

20 

30 

60 

2 

25*39 

6-27 

4*27 


3*27 

■Ml 

2*86 

2-73 

2-59 

2*52 

2-46+ 

2*39(3) 

3 

6*19 

3*66 

3*05- 

2*84 

2*72 


2*66 

2*51 

2*46-* 

2*42 

2*39 

2*36(2) 

4 

4*21 

3-05- 

2*77 

2*65- 

2*58 

2*63 

2*48 

2-45- 

2*41 

2*39 

2-37 

2*36-(1) 

5 

3-56 

2*84 

2-66- 

2*66 

2*61 

2*48 

2*44 

2*42 

2*39 

2*37 

2*36 

2*34(1) 

6 

3‘26’- 

2*73 

2-68 

2*51 

2*47 

2-45- 

2*42 


2*37 

2*36 

2*35+ 

2-34(1) 

7 


2*66 

2-54 

2-48 

2*46'^ 

2-43 

1 2*40 

2*39 

2*37 

2*36 

1 2*35- 

2*34(1) 

8 

2-96+ 

2*61 i 

2-61 

2*46 

2*43 

2-42 

1 2*39 

2*38 

2*36 

2*35+ 

2*34 

2*34(1) 

9 

2-87 

2*68 

2*49 

2*46"- 

2*42 

2*41 


2*37 

^•36 

2*36+ 

2*34 

2*33 

10 1 

2-81 

2*65'^ 

2*47 1 

2*44 

2*41 1 


2-38 

2*37 

2*36+ 

2*36- i 

2*34 

2*33 

12 

2'72 

2*61 

2*46- 

2*42 


2*39 

2*37 

2*36 

2*36+ 

2*34 

2*34 

2*33 

14 

2-87 

2*49 

2*43 

2*41 

2-39 

2-38 

2*37 

2*36 

2*36' 

2*34 

2*34 

2*33 

10 

2-63 

2*47 

2*42 

2*40 

2-38 

2*37 

2*36 

1 2*35*^ 



2*34 

2*33 

18 


2*46 

2*41 


2*38 

2-37 


2*36+ 



2*33 

2*33 

20 

2-58 

2*45- 

2*41 

2-39 

2-37 

2*37 

2*36 

2-35-- 

2*34 

2*34 

2*33 

2*33 


Table 6. 1 % points of «(m, n) 


1 


1 


4 

6 

6 

8 

B 


20 

30 

60 

2 , 

60*79 

8*93 

6*49 

4*43 

3-93 

3*64 

3*32 

3*14 

2*93 

2-84 

2-75- 

2*66(4) 

3 

8-82 

4-34 

8*60 

3*30 

3*14 

3-03 

2*91 

2*84 

2-75- 

2*70 

-2*66 

2-62 (2) 

4 

6-42 

3.60 

3.20 

3*02 

2*92 

2*86 

2*79 

2*74 

2-68 

2*66 

2*63 

2*60(1) 

5 

4-38 

3*29 

3*02 

2*90 

2*83 

2*79 

2-73 

2*70 

2*66 

2*64 

2*62 

2-60(1) 

6 

3-90 

313 

2*93 

2*83 

2*78 

2*74 

2-70 

2*67 

2-64 

2*62 

2-61 

2-59(1) 

7 

3-63 

3.03 

2-87 

2*79 ' 

2*75- 

2*72 

2*68 

2*66 

2*63 

2*62 

2.00 

2*59(1) 

8 

3-45+ 

2.97 

2*83 

2*76 

2*72 

2*70 

2*67 ' 

2*66- 

2*62 

2*61 

2.60 

2*59(1) 

9 

3-33 

2*92 

2.80 

2-74 

2*71 

2*68 

2*66 

2.64 

2*62 

2*61 

2-60 

2*69(1) 

10 

3-24 

2-88 

2*78 

2*72 

2*69 

2*67 

2*66- 

2*63 

2*61 

2*60 

2.69 

2*69(1) 

12 

3-12 

2-83 

2-74 

2*70 

2*68 

2*66 

2*64 

2*62 

2*61 

2*60 

2-69 

2-58 

14 

305- 

2.80 

2*72 

2*69 

2*66 

2*66- 

2*63 

2*62 

2*60 

2*60 

2-59 

2*68 

16 

2-99 

2-78 

2*71 

2*67 

2*65+ 

2*64 

2*62 

' 2*61 

2*60 

2*60 

2-69 

2*58 

18 

2.96- 

2*76 

2*70 

2*66 

2*66- 

2*63 

2*62 

2*61 

2*60 

2*69 

2-59 

2*68 

20 

2-92 

2-74 

2*69 

2*66 

2*64 

2*63 

2*62 

2*61 

2-60 

2*59 

2-69 

2*68 « 


Note. The numbers in brackets in the column headed m = 60 indicate the nuaiber of units which must 
be subtracted in the second decimal place to obtain the level for m = 120 and the same value of n. Where 
no figure is given u(120,n) = u(60,n'} to second decimal place accuracy. E.g. for the 2% level 
m{120,6) = 2'33, 
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Table 7. 0-2 % points ofu = u(m, n) 


\ m 

n \ 

1 

2 

3 

4 

5 

6 

8 

: 

10 

15 

20 

30 

2 


20*1 

9-4 

6-8 

6*7 

5-1 

4*6- 

4*1 

3-7 

3*6 

3-4(2) 

3 

19‘8 

6*8 

6-1 

4-6 

4-2 


3*7 

3-6 

3*4 

3*3 

3*2 

4 

9-4 

61 

4*2 

3*9 

3-7 

3*6 

3*6- 

3-4 

3*3 

3*2 

3-2(1) 

5 

6-8 

4-4 

3-9 

3-7 

3*6+ 

3*6- 

3*4 

3*3 

3*2 

3'2 

3-2(1) 

6 

6-7 

4-1 ' 

3*7 

3*5+ 

3*4 ^ 

3*4 

3*3 

3*3 

3*2 

3*2 

3*1 

7 

6*1 

3-9 

3*6 

3-6-" 1 

3*4 

3*3 

3*3 

3*2 

3*2 

3*2 i 

3*1 

8 

4-7 

3*8 

3*6+ 

3*4 

3*3 

3*3 

3-2 

3*2 1 

3*2 

3*2 , 

3*1 

9 

4*6 

3-7 

3*5- 

3-4 

3*3 

3*3 

3*2 

3*2 

3*2 

3*1 

3*1 

MM 

4‘3 

3*6 

3*4 

3*3 

3*3 

3*3 

3*2 

3*2 

3*2 ' 

3*1 

3*1 

12 

4‘1 

3*6+ , 

3*4 


3*3 

3*2 

3*2 

3*2 

3*1 

3*1 

3*1 

16 

3-9 

3*6-* 

3*3 

3*3 

3*2 

3*2 

3*2 

3*2 

3*1 

3*1 

3*1 

■ 

3-7 

3-4 

3*3 

3-2 

3*2 

3*2 

3*2 

3*1 

3*1 

3*1 

3*1 


Table 8. 0-1 % points ofu = u{m, n) 


\ m 

n \ 

1 

2 

3 

4 

5 

6 

8 

10 

15 

20 

30 

2 

607*9 

28*5- 

11*7 

8-1 

6*7 

5*9 

6*0 

4*6 

4*1 

3*9 

3*7(2) 

3 

28-1 

8*7 


5*1 

4*6 

4*3 

4*0 

3*9 

3*7 

3*6 

3-6(1) 

4 

11*8 

5*8 

4*7 

4*3 

4*1 

3*9 

3*7 

3*6 

3*6+ 

3*5“ 

3-4(1) 

6 

8-2 

6*0 

4*3 


3*8 

3*7 

3*6 

3*6 

3*5“ 

3*4 

3-4(1) 

6 

6*7 

4*5+ 


3*8 

3*7 

3*6 

3*6+ 

1 

3*6“ 

1 3*4 

1 3*4 

3*4(1) 

7 

6*9 

4*3 

3*9 

3*7 

3*6 

3*6 

3*6+ 

3*6“ 

3*4 

3*4 

3*3 

8 

6*4 

4*1 i 

3*8 

3*7 

3*6 

3*6+ 

3*6“ 

3*4 

3*4 

3*4 

3*3 

9 

6*0 


3*8 

3*6 

3*6 

3*6+ 

3*6“ 

3*4 

3*4 

3*4 

3*3 

10 

4*8 


3*7 

3*6 

3*5+ 

3*5 

3-4 

3*4 

3*4 

3*4 

3*3 

12 

4*6+ 

3*8 

3*6 


3*6 

3*6- 

3*4 

3*4 

3*4 

3*3 

3*3 

16 

4*2 

3*7 

3*6 

3*5+ 

3*5- 

3*4 

3*4 

3*4 

3*3 

3*3 

3*3 

20 

4*0 

3*6 

3*6+ 

3*6- 

3*4 

3*4 

3*4 

3*4 

3*3 

3*3 

3-3 


Note: The numbers in brackets in the column headed m = 30 indicate the number of units which must 
be subtracted in the first decimal place to obtain the level for m = 60 and the same value of n. Where 
no figure is given u{60,n) — u{B0,n) to first decimal place accuracy; 'ti(120,n) = m(60,m) for (i) all 0-2 % 
points except that m( 120,3) = 3'1 and (ii) all O-l % points except that m( 120, 2) = 3-4 and m(120,3) = 3-3. 




























Table 9. Table for testing the signijicance of the deviation of the mean 



The table gives values of the ratio — = deviation of a a mp le mean jyj j^g different levels of significance; 

^ iv range m Sample 

the levels being the sum, a, of the two tails of the probability distribution. 



si^ii^ance. The levels are the sum, a* of the two tails of the probability distribution. 

V. j- * considering deviations in the positive (or negative) direction only, the values of ce at the 

headings of the columns should be halved. 
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Table 11 


n 

d„ 




2 

11284 

0*8862 

1-4142 

1-6968 

3 

1-6926 

•5908 

1-7321 

2-9316 

4 

2-0588 

•4857 

2-0000 

4-1176 

5 

2-3269 

•4299 

2-2361 

5-2009 

6 

2-634:4 

0-3946 

2-4496 

6-2080 

7 

2-7044 

•3698 

2-6458 

7*1661 

8 

2-8472 

-3612 

2-8284 

8-0631 

9 

2-9700 

•3367 

3-0000 

8-9101 

10 

3-0776 

j 

•3249 

1 

3-1623 

9-7319 

11 

1 3-1729 

1 0-3162 

3-3166 

10-6232 

12 

3-2686 

•3069 

3-4641 

11-2876 

13 

3-3360 

•2998 

3-6066 

12-0281 

14 

3-4068 

-2936 

3-7417 

12-7469 

16 

3-4718 

■2880 

3-8730 

13-4463 

16 

3-5320 

0-2831 

4-0000 

14-1279 

17 

3-6879 

•2787 

4-1231 

14-7932 

18 

3-6401 

-2747 

4-2426 

15-4435 

19 

3-6890 

•2711 

4-3689 

16-0798 

20 

3-7360 

•2677 

4-4721 

16-7032 
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THE FREQUENCY DISTRIBUTION OF V^i SAMPLES OF ALL 
SIZES DRAWN AT RANDOM FROM A NORMAL POPULATION 

By R. 0. GEARY 


1. Iktrodtjotoky 

A research on which the writer has been engaged for some years has so far yielded the 
following results: 

(1) Testing for normality has a greater practical importance than statisticians (including 
the writer) have been disposed to accord to it; actual probabilities may be seriously at 
variance with probabilities derived from the well-known tables computed on the hypothesis 
of universal normality; m consequence, testing for' normality and, where necessary, 
correction (even if rough and tentative) for suspected universal non-normality, should 
become a part of statistical routine. 

(2) For large samples, and 6^ are the most efficient of large fields of tests of skewness 
and kurtosis, respectively, amongst large fields of alternative universes. 

These matters will be dealt with in detail in subsequent papers. It seems, in the first 
instance, desirable to derive the frequency distribution of for normal random samples 
of all sizes, partly on account of the inherent importance of the problem, partly in order to 
explore a computational technique which might be found effective in solving the analogous 
but probably more difficult problem. 

Towards the solution of the problem there are available the exact values of first four even 
moments~the odd moments are, of course, zero—of normal the second, fourth and 
sixth having been determined by R. A. Fisher (1930) and the eighth by Joseph Pepper 
(1932). It may be useful here to set out the four moments. Taking 


= m^jml = 


n 








( 1 - 1 ) 


where n is the sample number, we have 


^ {«.+l)('H. + 3)’ 

^ _ 10S(w-2)(!aH27?i-70) 

(^l)'('ft + 3Hw + 5) (n-^)(n+9) ’ 

„ ^ 3240(Ti,~2)(uH8 4aH2695-n,^- 1516H71 +200201 

^ V+1) (wTlf n+ 6) («r+7) (71 + 9){n+lVu^^^ 

7.5.3L 2'>(n- 2) (u® + IT In® +13893# + 580401# - 6131014# 

_____ +14132268n-12932920) 

(w+1) (■a4Y) (n+ 5)... (u+17) (%+T9y(#+^ ' 


(1-2) 
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E. S. Pearson (1931, 1936) derived empirically 0-05 and 0*01 probability points for certain 
values of 25 using a Pearson Type VII curve and earlier approximations by R, A. Pisher 
(1929) of the second and fourth moments. 

The method here used for the derivation of the frequency distribution of^b^ is essentially an 
elaboration of that which the author used (1935,1936) for finding the frequency distribution of 
the test of kurtosis a (the ratio of the mean deviation to the standard deviation of the numbers 
sampled), which consisted in establishing a relation in integral form between the frequency 
ordinate for n with the value for (?i — 1) and thereby determining the ordinates to any 
required degree of accuracy for the lower ti’s. At a certain stage the actual frequency is shown 
to be very close to the value based on the Gram-Charlier curve for the same value of n; and 
the assumption is made that the Gram-Charlier may be relied on for values of n greater 
than the ‘ trar\sitioia value'. In the present problem the known normal moments are utilized 
as well at every stage. In the concluding section the status of the solution in the hierarchy 
of ‘precision^ is discussed. 

Since the frequency is symmetrical, attention is confined practically exclusively to the 
positive sector. 


2. The general integral iteration 

To distinguish the sample size by the notation let the value of ^b^ be indicated by Apply 
a Helmert orthogonal transformation to the original observations * *. ? so that 


*2 = {Xj^-\-X2 2a:3)/-^6, 


<-i = {Xi+X2+...+x,,_T^-n-lx,^)l^[n(n-l)], 
= {x^ + X2+ .■. + Xn)[^n = x^n, 

which, on inversion, gives 


{n-2 )x',^2 , <-i 

V[(^-l)(u-2)]^VWn-l)]’ 

i—1 


■r.„..i-x = 

x„~x = 


x[ xL x'-, 

X-t 3/0 1 

2*0 x'-i 

'76 


( 2 - 1 ) 


( 2 - 2 ) 


Then 


(2*3) 
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Frequency distribution of Jb^ 


+^^“'(^+;|o+-+Vi(S^) 
-+ V[(^riV J 


^f[{n-l)n] 

2^ _ 3*4® {n~ 2 ) x^ti 

V6 Vl2 V20 ■■■ ^[{n-~l)ny , 
Apply a polar transformation to the *. 1 , that is, 

x[ = rsin95,j_3 sin5!i,^_4... sin^ij sini/io,' 
ajj = rsin54,,_3 sin9i„_4... sin^i oosjio, 
^3 = J-sm0,,^_3 sin0„_4... sin^ia cos$ii, 
= »• sin^,j_3 sin^,^_4... sin9^3 cos 9^3, 

*n-3 = »'sin0,i_3 sin9i.ft_4 00s 
*«-2 = »'sin9i„_3 oos^4„_4, 

<_i = rcos9i„_3, 

and ~ —a;)® = r®, 

I 3 

K sin® 95„_4 .,. sin ®954 8 in ®953 0089^0 

3 

+ 712 9^2 cos 9^4 


^ ■ ■' ^ V[(»» - 2) (w -1)] ^’‘-3 9^»-4 sin* 00s 

-~sin^h-z sin®56„_4... cosB^i^-^sin®^,,.,... sin®?^^ cos®,^^ 


■V >-4 / '-j ; 

whence the fundamental iteration 

ni " (W-1)‘9^»-3 cos95^_3- cos ®, (2-8) 

in which there intervenes only the angle ; and for normal random samples it is a well- 
TsTei^L^^ of one another, the distribution of 

( 2 . 9 ) 

Now 4 -x inTOlves only </>„ hence it is independent of A , Aocordinulv if the 

frequenoy distribution of «„_4 is of the form ^ rdrngly, it the 

/»-i(^n-i) (2-10) 
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the joint distribution of ^ 
Now, from ^2-8), 


„_3 and is given by 
C sin’*' ® 3^5^11-3 ^ 1* 



On substituting in (2'11) and integrating we find for frequency of the expression 


LiK) 


Xirn ) !■(«,- 4)! J > 


( 2 - 11 ) 

( 2 - 12 ) 


(2-13) 


where the relation (2»8) obtains. Integration extends to values of (so that 0 <7r 

for n>3) which yield non-zero values ofSetting coa0^_3 = x the integral at (2*13) 
assumes the form 

/ /M _ _’-^IT i 

(2-14) 
(2-15) 


with, from (2*8), 




In the derivation of the frequencies for n = 4 to 8 inclusive, dealt with in later sections, both 
the forms (2*13) and (2-14) are used. 


3, Ftootiokal discontinuities oe the eubquenoy 

In the integral at (2*14) appears merely as a parameter. Consequently the nature of the 
frequency depends to a considerable extent on the simple algebraic properties of 

tn^i{x) given by (2-16). The following property (easily demonstrated) is fundamental: 


For t,, = in-2k)lik{n~k)]^ ^ jj,, {k = 1,2,...), (3-1) 

a maximum value of 

{n-%k-Vl)l[{k^l){n^k)f (3-2) 

for x = - [(n - k)lk{n - 1)]^ = (3-3) 

and a minimum value of 

(n^2k-l)j[k(n-k^l)]^ = (3*4) 

for , [kj(n- 1) (n-k)f = (3-5) 


Definition, are termed the link values or links of The regions between consecutive 
links are termed zones. The graph of t^^iix) for — 1 ^ +1 and (given at (3*1)) 

is illustrated in Fig, 1. The limits of integration for integral (2-14) are now seen to be and 
which arer the values of x at which the ordinates of the curve (2*16) in {xj.^^_i), with 
parameter assume the limiting values and The scale on the right 

shows the links of The curve traverses all the zoiies but has a 'turn^ in the 

(i—l)th zone, remaining entirely in the zone the while. It is due to this turn that the 
phenomenon of functional discontinuity manifests itself in the frequency 
Assume that within the Ath zone the frequency/,j,_ 3 L(i,,^_i) is represented by jcfn^iil^n-i)} 
different in functional form for different values of h but the same (for example, having the 
same coefficients in a power series) within each zone. It will at once be evident, from (2*14), 
that the frequency of will have a like property. Now, from (2*4) and'(2' 5) it will be seen that 


^ _ cos 3 ^q 


(3*6) 



^72 Frequency distribution of Jbi 

the distribution of is rectangular, so that the distribution* of is given by 

/■W-.Virus' 

and zero for | ig | > 1 /V^- If follows that has a functional discontinuity at its links ± 1/^2. 
Hence, by iteration, the frequency oft,,, is represented by different functional expressions in its 
different interlinh zones. 


Link of 



Fig, 1, Graph of 


That the frequency has a finite limit (when n is finite) is established as follows. It can 

easily be seen from (2-15) that when t„ = pr„ the curve t,,_j = t„_i{x) degenerates into (i) the 
straight line a: = -^ 1 and (ii) a section above the straight Kne but touching it. 

®'oi^i»>i7'«nopartofthecurve«„_i = falls within the rectangle a; = ±l,t„_^ = ±iT,,_i. 

Reference to (2-14) shows at once that, if= 0 for ) then= 0 fox 

But (3'7) shows that t^ has as limiting values + Hence, by iteration, it 
follows that the limiting values of the frequency of t,^ (or simpliciter of are 

= ±{n-2)l{n-l)^. 

* R. A. Fisher (1930). 


(3-8) 
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As will presently appear, the frequencies fov n - i and 5 have marked irregularities: 
successive integration in accordance with (2d4) imparts, of course, a progressively increasing 
degree of smoothness to the frequency. To give mathematical expression to this feature, 
recourse is had to the idea of orde/r of contact. 

Definition. Two functions are said to have contact of order at link if the functions 
and their first (jcyn^^) derivatives are finite and equal at the link. It can be shown without 
difficulty that 

kyn “ + 

when k>l, n>^. For what follows it will be convenient to set out for the smaller sample 
numbers the values of the links and their orders of contact. The links for positive values 
only of the variables are shown. The orders of contact will appear from a proposition 
proved in § 5, giving the actual values of the frequencies near the limit of range. The non¬ 
diminishing smoothness in the direction of the centre of the range will be noted. 


Values of and n = 3 8 inclusive 



Isfc link 

2 nd link 

3 rd link 

4 th link 











1*^71 


n 

{Yii 


sTn 



3 

1 /V 2 

0 


_ _ 

_ 


_ _ 


4 

2 /V 3 

0 

0 

0 

— 


— 

— 

5 

3/2 

1 

i/Vo 

1 

— 

— 

— 

— 

6 1 

4 /V 5 1 

1 

i/Vs 1 

2 


2 1 

- , 

— 

7 

6 /V 6 

2 

3 /VlO 

2 

1 / 2^3 

3 

— 

— 

8 

6 /V 7 

2 

2 /V 3 

3 

2 /VlS 

3 

0 

4 


For even values of n the origin is always a link. In the determination of the frequencies 
for = 5 to 8, by the methods described in subsequent sections, the link ordinates and the 
central ordinate play a cardinal role. In fact, the method will be seen to consist essentially 
in finding curves which pass through the central and link ordinal points, have the required 
orders of contact and the required form at the limit of range and have the exact earlier 
momental values (see first section). 

4. The frequency near the centre of range 
It will first be shown that 

/^(4-0) 0 for n>4,, (4*1) 

In fact, from (2*14) and (2*15) if small positive quantity, 

p+A-^'U■=*A'' 

/„(«) = a dx{l- 

J —A—kU”A" 

/b /c being positive constants. Hence 

/'(it) = - ^^{{l - A"2)K«-’)/„-a[i„-i(A")3 - (1 - 












74 


Frequency distribution of Jb^ 


Letting M-^0 the integral-free expression obviously vanishes provided that/„_j[<,,_j(A)] is 
finite, which it is when m > 4; and the integral becomes 





Since/,i_i(2/) is an even function of y, its derivative is odd which remains an odd function 
when y is replaced by an odd function of a;. Hence the integral vanishes. 


6. The form of the frequency at the limit of range 
In this section it will be shown that near = ± - 2)/(n-1)^ the frequency is given by 

1 ^(w-3)! 


Utn) = 


-4)U-1 


Un-i) 


3 - 4)! (3«,. n - 2)«’’ 

It may be seen at once that for w = 3 the frequency by (5-1) would be 


( 6 - 1 ) 




(5-2) 


as at (3-7). Eor n ~ i, (5'1) gives ^^3, which is the value found by A. T. McKay (1933). 
The general theorem will be proved by iteration. We assume a general form 

in-3^ „ \ 

/«.-i(A-i) “ ~ A-i| > (5‘3) 

and show that a similar form emerges for/,|,(i„), finding incidentally an iteration relation for 
the constant G^. First set 

« = — 2 — 71 — 1* Aj 

and assume that is a positive quantity. It wiU readily appear, from (2-16), that, for v = 0, 
a double root at a; = l/(?i — 1). Accordingly we set 



a: = 

x' -f- 1 /( 71 -- 1) 

(6-4) 

and 

X = 

(71-3)2 

(71-2) 

(6-5) 

Having regard only to principal terms we find 



1 

(7l-l)«’ 

(6-6) 


X 2n-^{n -1 )3 (n - 2)-2 (71 - 3) ( 1 - a;''^) 17 I, 

(6-7) 

with 

x' — 

ilBrh'- 

(5-8) 

Now, 

from (2-14), 





iiyi|/^(i-^^)'<"-’)A-x(A-i). 



and, from the analysis in § 3, it wiU be clear that there are two separate parts of the domain D: 

(I) a part near x = 1/(71 — 1) for which is entirely in the first zone and by hypothesis 
has the form (6-3); 

(II) a part near a: = — 1 ip which assumes all values. 

♦ The symbol = signifies ‘equals, to required approximation’. 
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Let fniin)-mtn)+mtn)> (6-9) 

where the functions on the right represent the contributions accruing from the respective 
parts of the domain of integration. Then 


fi(t 




X {n — 2)~^ {n — 3) {1 

which, on a change of variable from x to a:" by (6-4) and (5-8) and integrating in x", becomes 

(5-10) 


which is of the required form. As regards the contribution of II, in (2*15) set 

x-j-l ^v + ?/#. 


( 6 * 11 ) 


It will be found that when has its limiting value + {n — ^)j{n — 2y the vanishing of the 
terms in and gives 

/y = 1 71 , and V — - /- 

ti 3^ and i/-9^3^2(^_2)4> 

whereas if has its limiting value ^[n — 3)/(w “• 2)1 the values are 

2/2 (^-3) 


p = \n and y = 


Between the limits of x, 


9^j ^n^n — 2)^' 


-- 1 j-- 

* n— -L 


V 2/2 n — 3 


3n^{n-2)i ’ '' 

given by (2-15), assumes once all values between its limits of range and, in fact, 


(5-12) 


/u (^n.) = 




'3 n® ■ 

(5-13) 

J 

(6-14) 


the limits of integration in a; being given by (5-12). By (6-11) and (6-13) change the variable 
* into (via y) when (5-14) becomes 


/n (*«) - C'(w) _i) 


and the integral on the right is unity. Written in full (6*15) then becomes 
mif ^^ 1 i(rt-3)! _ 3 


(5-15) 


(6-16) 


There is no difficulty now in proving by iteration from (5*10) and (5*16) that the constant 
has the form indicated in (5*1). Note that, in (5*9),accounts for {n’-l)jn and/^for Ijn 
of the total frequency. 
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6. Samples op 4 

From (2'14), (2'16) and (3-7) the frequency for % = 4 is found to be 


{6-I) 

( 6 - 2 ) 


■^vhel'e ?/ = 2(1 - x^f ~ - 3a; + with v - fS 

D is the range of values of x which give non-negative values for y with | a: | < 1. Now 

y = -(Zx^-\f{Zx^-2)-2v(5xfl-2.x)-v'^, (6-3) 

from which it appears that when v is small y has two real roots near — Ijf^, two imaginary 
roots near* and single roots near and accounting thus for all six 

roots. With 0(^1) = 0 the four real roots are 

mth 


/2 V 

- V3“9' 


Hence the integral at (6*1) may be written as the sum of fiye integrals 


r-Vl-\-vlQ /• 

+ 

—VS—v/9 - 


“1/V3—avi 




■V1+W9 


fliV3-onji^ |*l/V3+a'u^ /*VI-'P/9 

J l/V3+ar^ 


+ + 

-l/V3+«vi l/V3-«vi 


Fig. 2 illustrates the division of the region of integration £), There are five divisions, 
numbered I-V, in what follows, in which we regard as "principal terms" only those in log v 
and the constant term. Terms in will ultimately be ignored: 


r-Vi+w/B 

I = =. 0{V^), 

J -Vi^-v/9 


II == 


-l/\/3-ar^ 
— V^+WU 


dx( — 1 -f (2 — 
1 


4^3 


‘(fr- 


—log 3a^t?|==^ 


* v/O 

J l/V-S + aoi 


=v. 


Ill 


IV 


-f 


+ l/V3-aDi 

J -l/V3-«ui 

l/V3-l-aD^ 


-jMl- 3»V (2 - - 5^3% [^) , 


/•l/V3-l-aDr ^ -fl 

^ 1/V3—avi^ V —1 


' 1/V3—avi^ 

Neglecting 0(?;i) we accordingly have 


^ 1)”"^ dx^ == ^ sinh~^ 1. 


I+II + III+IV + V=^ sinh”"^ 1 + 2(7, 

The constant 2(7 derives from additional terms in integrals II, III and V: 


(6-4) 


with 


™ == J _ 2 v{ 5 x^ - 3 a;) - 

7 = 1/^3 

* Tn. a sense which will be obvious from Fig. 2. 
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We have already taken account of the term in III found when v is zero. The constant C 
derives from the even powers of z in the formal expansion of the denominator of the integral 
element—the qdd powens vanish by symmetry. Setting 

* = ^ 3 - 3 !', l-3a;2-2,/3a:', 

On expairsion of III, 

00 ei/va 
k^lj ctv^ 

where G^tc fhe even-order coefficients in the expansion of (1 -I- z)~-, i.e. t/aj. is the coefficient 
of z^'^. On integration we are interested only in the value at the lower limit av^, for all 
terras at the upper limit (and certain terms at the lower limit) are 0{v^) at least. Hence 



Noia. This diagram is designed merely to give a general idea of the limits of integration. It is not 
drawn to any scale. Following are the values of the 


£1 = 4 

g' = V|--n/9 


£2 = 1/V3 
g' = l/V3 + anl 


•0:2 = 2/(9 V3) 


Similarly II-l-V also yield 0 giving a constant additional term of 2C. 

= ‘ f ‘ + 1 f+#)-'+(1 

,, 0 ^ 2y3J 0 ^ 


-i[- 


logx + ilog 


(1+ 31^-1 
(1 d-X®)* •+• 


I + llogj 


1 ~(] — 

+ (1-X2)i 


G 




{log2 + ilog (2i-l)}. 


x<=X 

x^O 


(6-5) 



' 7 g Frequency distribution of Jb^ 

All logs are to base e (unless otherwise indicated in what follows). Hence 

=^0-372646--log i; ( 6 * 6 ) 

:= 0-285222-0-366466logio I ill. 


since t)s=^ 3 < 4 . 

A. T. McKay (1933), from a different approach, gave the log term in (6-7), and, as a rough 
approximation to the constant term, the value 0 - 311668 . He also showed that an expression 
of the form ( 6 - 7 ) accounted for most of the frequency, a fact of great importance. Assume 
that the residual term is of form 

and find A and B from 

(i) / 4 ( 2 /V®) = McKay (1933)), 

(ii) total frequency is unity', 

giving 

hik) = 0-286222-0-366466 login j ) -0-009178 ji + 0-031359 j t^\. ( 6 - 8 ) 

For algebraic manipulation at the next stage the form of residual A' | {4 ] + B'tl will be found 
more convenient, however, with A' and B' also determined from (i) and (ii). In this form 

h{h) = 0-286222-0-159156log) ) +0-0142751| +0-007398t|. ’ (6-9) 

Note the smallness of the coefficients A, B, A' and B' in ( 6 - 8 ) and (6-9). 

In the following table the first four even moments as derived from frequencies ( 6 - 8 ) and 
(6-9) are compared with the actual values as derived &om the formulae ( 1 - 2 ). Both formulae 
jueld excellent approximations, with ( 6 - 8 ) always superior to (6-9) however. Either formula 
can obviously be used with complete confidence for deriving the probability points. The 
frequency graph in Pig. 4 is derived from ( 6 - 8 ) which should also be used for the computation 
of the probability points. 


Moment 

Actual 

Formula 

(6-8) 

(6-9) 


0-342867 

0-342930 

0*342470 


0-268941 

0-268979 

0-268606 


0-240603 

0-240263 

0*240205 

/«8 

0-245940 

0-245949 

0-246662 


7. Samples op 6 

After many computational experiments the method used for determining the frequency 
fSi) as follows: 

( 1 ) Using (2-14) with form (6-9) for central and link ordinates, i.e./ 5 ( 0 ) ^ndMl/f6) 
were computed. 

( 2 ) The approximate value of/^(tg) near tg == + 0 was found in the form 

/5(1/a/ 6) f Jlf(i5-1/V6)n 


if being known. 
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(3) The two zonal curves were found (i) passing through (0, / 5 ( 0 )) and (l/^/ 6 ,/ 5 {l/V 6 )) 
with/g(0) — 0 and (ii) passing through (l/g/ 6 ,/g(l/^ 6 )) and with the required form at 1/^6 + 0 
(i.e.- as at ( 2 ) above) and at the limit (| - 0 ) so that//.g (= 1 ), /i 2 > and/tg have the exact values 
as given for = 6 by the formulae ( 1 ‘ 2 ). 

Setting then 

Uh) = 0-286222-()-1591561og, j \ + (7*1) 


with 

and 

we have 


^ ^5 (1 —a:^)* ’ 

i?(y = 0-014275 I ti 1 +0-007398i|, 


(7-2) 

(7-3) 

(7-4) 


the limits of integration being A (negative) and (positive) which are the values of x, from 
(7-2), corresponding respectively to a/ 3 and ^4 = + ^3. We shall be concerned only 

with the case ^5 ^ 1 /^6 when t^{x) has three real roots /?, a and y of which is negative and a 
and y are positive. For (7*4) the following are required 


r dx ^, fi fj/i 
I"/'da: log (1 —x^ 


(7-5) 




= J{log^ (!+/<■)- log=^ (1 + A) - log’^ (1 - y«)+log^ (1 - A)} 
+ -^{log(H-/i) log(l-/i)-log(l + A)log(l-A)} 

+log21„6(^) + 7(i±#)-j(i±^' 


(7-6) 


f dx T 

J 


* 2+3 


= I log 






-ilog(^;^)log(l+a)(l+/9)(l + r) + ^(,|«+ (’•7) 


with 


Ky = (a-A)/(l-a), 
= (/i-a)/(l + a), 
== (7-A)/(l-r). 


<, = {/<-mi+^), 

AT, = (a-A)/{l + a), 


Ki = {jLi-y)l{l + y), Xg = (/<-a)/(l-a), 


^^9 = (r-'^)/(i+r)> 
'^10 = (/i-r)/(i-yK 


and 


7(/c)=f = log/clog (l + /c)-l/r(,c), 

J fi 1 +a: 

J’(/c)=r =-log/clog(l-/c)- 9 i(/c), 

JO i-'X 


K 

^{k) - p+ 2 a+p+ ••• 


... A. A- A" 


when /c < 1 . 



gQ Frequency distribution of 

It is useful to note that ^4(1) = 1-644934 = 2 f(l). The functions <f>(K) and ^(/c) do not 
appear to be tabulated. By fitting curves to their values for equally spaced intervals of-0-05 
from 0 to 0-6 the following very close approximations are found, applicable for a: ^ 

^{x) = 1-000567* 4 -0-233454a;® + 0-186052*3, 

^{x) ~ 0-999835*-0-244220a;3 + 0-077024a;S. 

When 13 ! /c> I the following formulae can he used: 

(p{K) =^(l)-log/flog(l-/c)- 0 (l'-/c), 

i/r{K) = 1 ^( 1 ) + log/c log (l + /r)-^(l-/f) + i{l-'r®)-. 

When AC> 1 we use 

(P(k) = 2(p{l)-^log^ I Ik 

f(K) = 2ir{l) + ^logniK^f{l/K). 

Another useful formula is 

4>(K) = ir(K) + ^^{K^). 

The algebra of the contribution to (7-4) from B{ti) is without mathematical interest. 
From the formula the following were the values found for the central frequency and the 
second link frequency: 

/5(0) = 0-606563; /^(l/Ve) =0-599069. (7-8) 

The moments, computed by (2-1), with n = 5, are 




= 1 , = 0-376, /i^ = 0-361607, = 0-474609, fig = 0-719382. 

Computation by approximate integration of certain of the ordinates gave evidence of 
marked irregularity near the link == IJ^Q. In consequence, it seemed desirable to try to 
find a term (in addition to the constant given at (7-8)) of the expansion off^{t^) near 1 + 0. 

m i i * 


Setting 




(7-10) 


A<^ 


(7-11) 


( 7 - 12 ) 


where t is small and positive—we shall be interested only in a term in —we find 

The values v ± At^ are the abscissae of the points at which the curve ^ given by (7*2), 
intersects the link line ^4 == 2/^3 near x — v — It can easily be shown that 

V? 

12 * 

We are not concerned with the values (A-fA'i) and {11 +which are the abscissae corre¬ 
sponding to the intersection of = t^{x) with — — 2/,^3 and its third intersection with 
^4 == ^/\/3‘ Remembering that at the latter link has the value l|{2^J^), the integral- 
free term in in the first derivative offS^) is 

X - \At-^ +U'>^ + Ati) X - - 3 ) 

Also we have to consider the integral term in/ 5 («g). For this purpose, from ( 7 - 10 ) and ( 7 - 2 ), 


h 


6 




W t 








t (l-x'i)'*. 
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Remembering (7-1), it can be shown that the only term in (7'11) from which a term in 
can come is approximately 


_4_1 r/« dx 

TT ^5 27tJ a 1 - 



A and fi, the limiting values, being respectively negative and positive. 
Differentiating (7-14) in respect of t we find 


(7-14) 


U Al-a:H\ 


L 2/ 1 t 




77-^ ^6, 

Changing variables by 
and letting t tend towards + 0, we find for the term in 


ys 9 ) 


Adding (7-13) and {7-16) we find 


TrVd 5 ' j- 


n 


On integrating we find for the term in 




-u-i--* 


1 i 3 / 1 / 1 \i 

-^5-2* =-0.594117(*,-^^) , 


From (6-1) the value offg{x) near a; = f is 

0-219166(f-a:)^ 


(7.16) 


(7-16) 


(7*17) 


(7*18) 


where x is usually written for simplicity instead of in the remainder of this section. 

Having regard to (7*8) and to the fact that, from § 4:,/5(0) 0, in the half-zone (0—1/^6), 

f^{x) = ^’(a;) must be of form 

= 0'606563-i-a2iiJ^ + <5&ga;®-}“a4a;^ (7-19) 

The first relation between the coefficients is found by giving expression to the fact that 
y F{x) passes through the link-point (1/^/6,0*599069): 

0*166667^2+ 0-068041a3 4* 0*027778a4 = - 0*007494. (7-20) 


In the zone ( 1/^/6 ™f) assume that 

/s(tr) = G[x) = ~0-594in^x-^y + 0-219166(1 - x)^+ho + 6 i(|-x) 

+ 6,(1 + 63 ( 1 -x)3, (7-21) 


designed to conform with requirements (7-17) and (7-18). Since y = Q{x) must pass through 

(i.o). /, 

bo = 0-59411711- 

Taking the value of 63 into account and giving algebraic expression to y = G(x) passing 
through (1/^6,0-699069), we find 

l-0917626i-f 1-19192262+1-30128363 = -0-260706. (7-23) 


vs) 


620776. 


(7-22) 
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To find the six coefficients ag, a 3 , (in (T*19)) and 63 (in ( 7 * 21 )), we have, so far, found 
two equations, ( 7 * 20 ) and (7*23). The remaining four equations are found by equating the 
total frequency to unity and the first three even moments to their true values given at (7*9), 
i.e. setting 




CllVQ Ti! 

= dxx^>‘F(x)+\ dxx^'‘0(x) (i) = 0,1.2,3). 

J 0 J l/VB 


(7.24) 


On substituting for given by (7-20), for bo given by (7-22) and for given by (7-23), we 
iind the four equations in a^, a^, b^: 

0.os90721aa+0.0n39999a3 + 0.297981bi.+ 0.108441b2 = -0-071173; 
0.0364802fl2+0.0ni0237as +0-26833261+0.082590b2 = - 0-066293, | 

0 - 0 ^ 6901 a 2 + 0 -on 07176 a 3 + 0-303248bi+0-079086ba = -0-076019, 

0-056368aa +0-05ll69fls +0-400090bi+0-089674ba =-0-101300., 

On solution (and checking by substitution) the coefficients are found to give Anally the 
following frequencies; 

Zone Mx) 

0-1/^6; 0-606563 ~ 0-3307a:2 + 3-1955*3 _6 .ii29cb4, 

l/V6-|:0-620776-~0-694117^a;-^j* + 0-219166(|^cB)i-0-268273(|-a;) 

+ 0-067263(1- a:)2- 0-029195(1- *)3, 

with a; = 4 . 

The extremely interesting form of the frequency curve may be observed from Fig. 6 , In 
the first half-z.one the frequency shows but little variation: the curve declines to a minimum 
of 0*6058 at 03 = 0*0894 then rises to a maximum of 0*6136 at aj — 0-3027. It then recedes to 
the link 1 /^ 6 , where it assumes the value 0-5991. As one type of check on the reliability of 
the results in general, some ordinates were computed directly (i.e, using (7*4) and (7*1)), 
or by approximate integration using (7*4) and ( 6 - 8 ) and compared with the ordinates 
computed from (7*26) to the following effect: 


(7*26) 


r 

Trial value 
of <5 

Value of frequency 

By approx, 
integration 

By (7-26) 

0‘15 

0*6069 

0*6068 

0-3 

0*6106 

0'6136 

0-6 

0-3660 

0-3603 

0*9 

0-2232 

0-2308 

1-2 

0-1377 

0-1371 


Except perhaps for the frequency at 4 = 0*9, the correspondence is satisfactory; there can 
be little doubt that the more accurate figures are those from (7-25). 

As a stringent test of the accuracy of the frequency tlie 8 th moment /jLq was computed 
from the empirical curves at (7-25) and compared with the actual value given at (7-9): 

/is = 0*7191, 0-7194. 




R. G. Gkary 83 

Even as the figures stand the check is decisive: it should be added that the 4 th place of 
decimals in //g is suspect to the approximation used. 


8 . Samples op 6 


In this case the links are 0, l/<^y 2 and 4/^/5, and the link frequencies at the first two were 
found by approximate integration using form (2*13) with given by ( 2 * 8 ). For this purpose, 
drawings were anade of the two sections of./ 5 (^ 5 ) on a scale sufficient to ensure that an 
ordinate read for any abscissa would be correct probably to the 3rd place of decimals. For 
intervals of 1 ®, values of ^5 were computed over the whole range by ( 2 * 8 ) (for given), and 
graphically the vahxe of/ 5 (i 5 ) was read off for each Hundreds of readings had to be made, 
but actually the work, with a little practice, was rapid and accurate, the entries being prac¬ 
tically self-checking. The Gregory formula (using 2 correction terms) was used to give the 
foUowing results: /a(0) = 0'6889; /^(l/Va) =0-3247. (8-1) 

The two zonal frequency curves, say y = F(x) in ( 0 — 1 /^ 2 ) and y — G{x) in ( 1/^/2 —4/^5), 
writing x instead of t^ must have the following properties: 

(i) F{0) = 0-6889, 

(ii) i''(0) = 0 (§4), 

(iii) F(l/^2) = = 0-3247, > ' ( 8 - 2 ) 

(iv) F'(ll^2) = G'(l/v'2) (§3), 

(v) G(,t;)=^6{4/V5-a;)/36 (from (5-.1))., 

The curves were 




The exact moments are 


^ 1 , = 0*380952, - 0*409101, /i^ = 0*642924, - 1*219892. (8*4) 

It is proposed to compute the seven coefficients in (8*3) using ( 8 - 2 ) and (8*4). Now, with 
a curve of the type of/ 6 (cr), w'here much of the frequency is at the ends it is evident that the 
contribution from the zone (0 — 1 j^J^) to the higher moments /Iq andis exceedingly minute: 
this property was utilized to divide the single series of seven equations into two series of 
three and four equations using the following device: Approximate F(x) by a curve Fi{x) 
given by J\(a;) = 0-6889+ a^a;2, (8-5) 

finding simply by passing y — F^{x) through (1/^/2,0-3247) giving a'^ = — 0-7284. If is 
the moment, let and /tg., be the contributions from F{x) and G[x) respectively so that 
A-js = i“ 2 s + Ks be estimate, using Ffx), of y!^^. For s == 2 , 3,4 the values are 

^4 == 0-030318, p'e = 0-009360, Vg = 0-003397, 
which, subtracted from the corresponding y ^s given by (8-4), give very close estimates of 
/4s, which involve only ig, dg, b^. The equations in order 

(1) 1-170178&2+1-26583753+1-36931664 = 0-174500,' 

(2) 0-72698052 + 0-37894953 + 0-23550254 = 0-058771, - ( 8 - 6 ) 

(3) 1-26006753 + 0-63470253 + 0-28966354 = 0-091217, 
are found from (iii) at ( 8 - 2 ), from /4 and from y"g. 


6-3 
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Frequency msirimmun uj 


The equations in the a’s are 

(4) O '602 +0'353563a3 + 0'25a4 +0'176777Oj =-0-3642, ’ 

( 6 ) I•41421402+1-503 +1-41421404+ 1-2605 =—0-653179, 

(6) 0-11786102 + 0-062503 + 0-035356o4 +0-02083305 = —0-115790, 

(7) 0-035365o2 + 0-020833o3 + 0-012627o4 + 0-007813a5 = —0-031433,. 

where (4) is from (iii) at (8-2), (5) from (iv) at (8-2), (6) from the total frequency = | and 
(7) from variance=/(g. 

The solutions of (8-6) and (8-7) yield the following frequencies: 

Zone A(^) 

0-1/V2 ; 0-6889-l-2716a!2-2-6073a;® + 9-5669a:^-6-7790a:^ 

1 1^2 - 4/V5: ^ (/?-*) + 0-047068(^ - a:)* + 0-024897(/? - xf + 0-064198(/?- a;)*, | 

with — 4/^6,) 

with X — Iq. 

As a check, the 4th moment computed from the foregoing curves gave 0-4108 as compared 
with the actual = 0-4092, an error of 0-38 %. This is not of any importance from the view- * 
point of the computation of the probability points, but it illustrates how, using the integral 
iteration method as generally in this paper, the momental check reveals increasing dis¬ 
crepancies with increasing n. 

It might be thought that by constructing empirically' almost any ’ symmetrical frequency 
curve, so that say the 0th, 2nd and 4th moments have the .true values, we shall ensure that 
the subsequent even moments computed from such an empirical curve will approximate 
closely to the corresponding true values. That this is not the case may be seen by computing 
the 6th and 8th moments by the well-known Karl Pearson iteration formula,’^ where 
and have their true values, for with n = 6: 



Actual 

Karl Pearson 

Percentage 


iteration 

discrepancy 

A = 

11-6291 

12-4934 

+ 10-7 

57-9214 

73-3990 

+ 20-7 


Even when, in the Pearson iteration for one gives y ?4 the correct value, we find a per¬ 
centage discrepancy of 17-9. These percentages place in perspective the minuteness of the 
percentage errors found in using the higher momental check as it is used throughout this 
paper. 

It is an interesting question of general import whether in work of this kind the arduous 
and potentially erroneous computation (by integral iteration) of the central and link 
frequencies could be dispensed with, and reliance placed entirely on the moments, together 
with the functional properties of the frequencies, which, of course, merely represent an 
elaboration of the Karl Pearson approach. In this connexion a couple of experiments were 
made on the frequency for = 6. 


* Tables for Statisticians an4 Binmetricians, Part i, 2nd ed., p. xi. 
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For the first experiment, the two zonal curves were assumed to have the correct order 
of contact, the correct form, (8-2) (v), at the limit of range and the correct values of 
/^o (= /^ 2 ) H Ae* The equations are 

Zone 

0- 1/V2 :F^(x) == 0*659844^ h075618a;H0-555991a;3, \ 

llp-4:j^5:0^{x) - •^(^-a;) + 0-080560(/?-^a:)2-0*085469(/?--a;)3l (8*9) 

-f0-133119(/?-a;)^ with /? = 4/^6.j 

This gives a central frequency 0-6598 compared with the computed frequency (by {8-1)) of 
0*6889. In all the circumstances the difference is not important. The 8th moment, /tg, feom 
(8-9), is 1*217706, or - 0*18 % in error. 

The second experiment contemplated the frequency as a single-curve system with correct 
first derivative (— •^) at the limit and with correct /^q, /^43 /^6* The curve is 

F^{x) = 0*669426--l-51097a;2+l-53854a;3-0-60545jr^4*0*085157a;5, (8-10) 

which has the properties: (i) the central ordinate 0*6694 is close to the actual; (it) limit value 
from curve scarcely differed from the actual since F3(4/^5) = 0*0015; (iii) /ig from curve 
=: 1-2237, an error of 0*31 %. 

All the systems (8*8), (8*9) or (8*10) yield probability points which differ very little. For 
instance, in the three cases, the 5 % point is given by 



The practical identity of the latter two is due to the fact that the frequencies were derived 
on very similar hypotheses: it does not mean that the result is more reliable than that, from 
(8*8) which, assuming the accuracy of the calculation of the link ordinates, must be deemed 
to be the most correct and is adopted for the iteration to the n ^ 1 stage. Nevertheless, these 
experiments convey the hint of general application that if we know (i) a number of moments, 
(ii) the limits of range and the frequency form at the limits of range,^and (iii) that the amount 
of frequency-near the limits of range is not negligible, we will probably be in a position to 
estimate with fair accuracy the points of low probability. For this, however, hypothesis (iii) 
is essential: it has no value from the computational point of view if the frequency near the 
limits is negligible. This point is discussed further in § 10. 

9. Sampms of 7 

The functional properties of the curves at the stage are as follows. Let the three links be 
denoted by a, /?, y, so that 

a=l/Vl2, /J-3/V10, r = 5/V6. (9*1) 


Denoting by set 


y - z ^ y-x, 


(9*2) 




gg Frequency distTibution of 

and let the curves in the half-zone (0-1/V12), and in the zones (1/^12-3/710) and 
( 3 / 71 O- 6 / 76 ) he denoted respectively by F(x), 0(y) and H{z). We then have 

(i) J(0) = 0-6781 = A, ' 

(ii) F'{Q) = 0, 

(iii) F(a) = 0(0) = 0'6870 = B, 

(iv) F'(x) = 0'(O), I ,^.3, 

(v) F"{<x) = 0"(O), I 

(vi) 0{f-a) = Hiy-F) = 0-1838 = G, 

(vii) a'{f-a)^-H'{y~F), 

(viii) E{z) = DzHc^z^ + c^z^ with D = 0-078091., 


The centtal and link ordinates .4, .6 and 0 at (i), (iii) and (vi), were derived by the Gregory 
formula from (2-14), using intervals, of 0-01, 0-025 and 0-05 at different sections of the 
integral range. The equalities in the derivatives at the links are in accordance with order of 
contact requirements (§ 3). The first term on the right of (viii) is from (6-1) with n = 7. 

Conditions (9-3) determine the form of the polynomials: 


F(x) = A + a^x^+a^x^, 

Q{y) = B + {M^oc. + ia^a^)y + W^aihl2ai(x?)y^ + bsy^ + biy^, - 
H(z) = Dz^+c^z^+GaZ^, 


(9-4) 


with X — i,. 

F{x) is taken as an even function of x because it is symmetrical in the zone (— 1/712 to 
+ 1 / 712 ). This should have been done in the case of 71 = 6; neglect to do so was not serious 
enough to render recalculation necessary. 

The moments used were: 


= 1 , fi^= 0-375, fii = 0-421875, = 0-733487. (9-6) 

Using (9-3) in conjunction with and y^ (only) in (9-6) the following equations in the six 
unknowns a^, b^, 64 , c^, Cg were found: 





Left: coeOEioients of 



Right: 

Eqn. 







absolute 

no. 


«4 

K 

h 

^2 


term 

1 

12 

1 

_ 




-13-112496 

2 

0^816667 

0-281315 

0*287607 

. 0*189766 

— 

_ 

- 0-403210 

3 

— 

— 

'— 

— 

M93686 1 

1-304172 

0-094626 

4 

1-897366 

0-766233 

1-306833 

1-150028 

2-185118 ' 

3-581055 

- 0-122438 

5 

0*229606 

0-069278 

0-047439 

0-026048 

0*434724 

0-356221 

- 0-1221S2 

6 

0-130637 

0-Q41871 

0-032192 

0-017836 

0-668438 

0-496634 

- 0-044366 


Approximations to F,G and H were found: 

jFi(a:) = 0-6781-l-238888a:2+l-764160a:S j 

0i(2/) = 0-6870-0-6464787/-0-3618082/2^.o■171129^/3 + 0-3471633/^l■ (9-6) 

H^(z) = 0 - 0780912 *- 0 - 0173192 ® + 0 - 088408 S ®. 
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These yielded estimates of the 4th and 6 th moments as follows: 

/ti = 0-419712, n'^ = 0-720776, (9-7) 

differing by — 0-6 % and —1-7 % respectively from the correct values at (9-6). These devia¬ 
tions were not serious from the viewpoint of probability-point determination. Nevertheless, 
it seemed worth while to try to achieve a closer approximation. This was done by finding a 
‘corrector’ ^{x) (not positive, like a frequency, for all values of ») with the following pro¬ 
perties: 

(i) total ‘frequency’ zero, 

(ii) ‘ 2 nd moment’zero, 

(iii) ^'{0) = 0 , 

(iv) ^(y) = = 0 , 

(v) ‘4th moment’ = 744 —/i 4 = 0-002163. 

Then (j>{x) = 0-002404-0-028853x2-f.0-045232a:3-0-024236*4 4-0-004342a:«, (9-9) 

and the frequencies finally adopted are 

O-I/V 12 ... F{x) = F^{x) + ^{x), 

1 /V12-3/^10,... G{y) = G^{y) + i>{x),- (9-10) 

3 /VIO- 6 / 4/6 ... H{z) ^Hi(z) + <p{x),. 

Fi, Gi and being given by (9-6) and x = t.,. It is evident from the smallness of the coeffi¬ 
cients oi<f){x) in (9-9) that the correction effected by <j)[x) is minute. From (9-10) the moment 
Hl is 0-728972, so that the error is reduced to about one'-third of what it was using F^, G-^ 
and 

10. Samples oe 8 

The links and link frequencies are as follows: 

Lmk Link frequency 

0 : 0-6927 = A, 

/? = 2/^15 : 0-4442 = B, 

y = 2/^*3 : 0-1018 = 6’, 

(J=6/V7 : 0-04019153(fi-x)2==D(a'~x)2/2, 

where x = ig. 

Set 

y = x-p, 

z = d~x, 

K = y-/? = 0-638303, 

A = ^-y = M13086. 

The orders of contact (§3) entail the following forms for the three zones: 



( 10 - 1 ) 



Zone 

0-2i:^15:F{x) = A + + 

21^15~2I^JS:0(y) ~ B + ^a^p + ag^+ai^y + ^a^ + agP + ai—'^ — A + ’(l^’.^) 

2/V3-6/V7 = + + 




gg Frequency distribution of Jbi 

Five of the seven equations required to determine the a, b and c will be found from the 
order of contact conditions, as follows: 

02 RZ ^4 

(i) F = ^+aa-%+03~+a4^. 

(ii) G = B+laa/y+Os 2^-04 g j + + 0 ’^^^24:’ 

A® A® A« 

(iii) 0 = D-^+ C 3 -g+C 4 —, 

R2 m ic^ ac 3 a ® a ® 

(iv) fta/^ + aa—+ a 4 ^+|^aa + a 3 ^ + a 4-2 j'^ + ^ 3 ^ + ^ 4 -g- ~ ^4.-^ . 

m 

(v) .ai+a^^ + tti^ + hsK+bi- = D + Cs^ + Ci—. 


The remaining two equations were found by equating the 0 th and 2 nd moments from the 
curves to the true values 1 and 4/11 respectively. The frequency functions found were as 


follows: 

F(x) = 0-6927--0•320142^c^‘-2*7751aJ3+3.08a;^ 

G{y) 0-4442-^0-8541771/ ■f0-308677i/ + 0*6496802/2-0*553667i/^, 
H{z) - 0-0401922HO-027763^^ + 0-008933^S 


(10-4) 


where x = 

For reasons which will be apparent in the next section, it was not deemed necessary %o 
apply higher momental checks in this case. 

Reference may here be made to yet another experiment, the negative result of which 
may have some interest. At the n = 6 stage the remarkable ^regularity’ which the curve 
assumed, after its highly bizarre appearance at the stage before, suggested that orders of 
contact (except at the limit of range) might be ignored at a slightly later stage and a single 
curve fitted using the moments only. 

Using /A4 and /Ag, and the D(S — z)^l2 (see (10-1)) for the forms at the limit of range 
with F[{Q) = 0 the following frequency curve was found: 


F^{x) = 0-040192(5--.a;)2 + 0-132866(^-a:)8-0-293716(^-0;)^+0-231146(5--a:)6 

- 0-039279(5- o;)« - 0*005209(5 - x)\ ( 10 - 6 ) 

The correct values of the moments (to 6 places) were 


/fc 4 = 0-414644, /tg == 0-763334, /tg = 1-823617. (10-6) 

The value of the 8 th moment computed from the curve was 1*993270, an error therefore 
of +9*3%. The central ordinate jPi(O) = 0*9017 as compared with the actual 0*6927, so 
that the curve F-^\x) could not validly be used for further iteration, since the frequencies near 
the central frequency would he considerably in error. The probability {computed from 
(10*5)) for Fi(a;) beyond the ‘true* 5 % probability point (computed from (10-3)) is 0*0466 
which is quite accurate enough for practical purposes. This concordance, unexpected in view 
of the other facts mentioned, is due principally to the fact that Ji(a;) has the correct form at 
the limit of range. This experiment shows that, despite the regularity of the distribution 
for 72/ = 8 , the problem of finding the nearly exact distribution cannot be treated in cavalier 
fashion. 
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11. Probability points fob fbeqitenoies fob samples of 8 ob mobe 

By the Gram-Oharlier theorem for symmetrical distributions under general conditions any 
frequency/(it!), where w has mean zero and Yarianoe unity, can be expanded in the form 


f{w) = exp 


il 


+ 


where 


4! \dw 

e{w)=^ 


6!AiUte/ ^8 


^8 / 

!At\; 


dwj 




(IM). 


1 




exp — 


the A being semi-invariants of the original variate. Let u be a normal variate with mean 
zero and variance unity. Using the method of E. A. Cornish & R. A. Fisher (1937) their 
expression for w in terms of u has been extended to the following effect: 


w ^ u- 




24A2^® 384At^® 


A« 


*5 + 


At 


720A| ® 


3072AI 


2/7- 


A 4 A, 


4''•8 


A« 


1152Ar’' 


4b32A* 


a?7 4* 


(1L2) 


where the Xj^ are Hermite polynomials in u of the degree indicated. The and Zi terms in 
('1L2) are as follows: 

= Zu, 2 / 7 = 451#-321w,' 

2 / 5 = 3^t® — 24u®-f 29u, Zrj ^ ^ 57%,“ (11*3) 

Xq — —10%®— 16%, 0^7 —%^+ 21%®—106%®+105%.. 

At (1L2) the expansion is taken to 0(%’“®) because A^fc/Ag is C)(n"*+^) when the A are 
semi-invariants of for samples of n. 

The Xfgy and %i functions at various probability levels are as follows: 


Function 


u 

iCg 

2/5 

*6 

3/7 


Probability points 

OdO 

0*06 

0*026 

0*01 

0*001 

+281662 

1-644864 

1-969964 

2-326348 

3*090223 

1-739867 

0-484338 

- +649229 

- 6*610905 

- 20-239364 

- 2-97984 

- 22*98240 

-37*09066 

- 30*28992 

+ 226-9286 

- +632248 

7*789164 

16*986942 

22*868797 

- 33*068481 

136-1309 

194*9663 : 

- 22-4606 

-676*7697 

-1286*263 

19*09291 

41-19969 

27*20947 

- 63*46639 ! 

- 261-9424 

- 19-5234 

- 74*2936 

- 88*4883 

- 16*6762 

362-6626 


(11-4) 


Por 71 = 8 the semi-invariants, etc., required are 


A,= 

0-363636, 

1! 

0-1357, 

A4 = 

0-017950, 

Ae/AI = 

-1-1612, 

Ae = 

-0-055836, 

11 

GO 

2-6577. 

A8 = 

0-046470, 




(11-5) 


If the formula at (11'2) were quite correct and then if we computed, at any probability 
level e, the value of w, then set a: = A|m! and from (10*4) computed the probability frcjm end 
of range the result should be exactly e, assuming, of course, that (10'4) gives the exact 
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frequency distribution. When this procedure is carried out at different pseudo-probability, 
i.e. the probability of x, levels indicated, the following results are found: 

Pseudo-probability 0-10 0-06 0-026 0-01 0-001 \ 

(a) True probability (to a.= a|w) 0-096866 0-060469 0-026825 0-011604 0-001166V (11-6) 

(b) Normal probability {to 0-093564 0-052376 0-029502 0-013419 0-001090) 

The correspondence at (a) is obviously satisfactory. At first sight it might appear that at 
0-01 and 0-001 levels the divergence is (by the standards of this communication) rather 
marked. Actually this is not the case considering the fantastic difference in the algebraic form 
of the Gram-Oharlier and the actual frequencies near the limit of range. The probabilities 
at (6) show that the normal curve gives quite a good representation. At» = 8, however, the 
comparison flatters the normal curve since, as R. A. Fisher (1930) has shown, the ratio 
A4/AI actually assumes its normal value of 3 at n = 7 and reaches its greatest value at -n = 22. 

We now propose to take a step which is discussed in some detail in the final section. We 
shall endow the right side of (11-2) with a remainder term which will make the probability 
of w formally the same as the pseudo-probability at (11-6). The following table shows the 
value of the variate ijO-i computed from (10-4) (where x represents 4) at different true 
probabiUty levels, together with the corresponding value of w computed from (11-2): 


Probability 

ijcr 

w 

R 

O'lO 

1*253173 

1-273231 

-82*2 

0-06 

1*671682 

1*666548 

21*0 

0*025 

2*043181 

2*008014 

143*3 

O'Ol 

2*446125 ; 

2*389594 

227*5 

0*001 

3*107977 

3*079696 

115*8 


It has been seen that the difference between m as given by (1T2) and the true value is 0{n~^). 
Accordingly the values of R were found at the different probability levels by setting 

B k 


= -2, 

n* a 


( 11 - 8 ) 


with m = 8. The estimates of the probability points P for values of w > 8 are accordingly 


=4 






(11-9) 


the values of A, B,0 and R being given in the following table: 


Prob¬ 

ability 

A 

B 

0 

D 

E 

F 


R 

0*10 

0*06 ! 

0*026 1 

0*01 ' 

0*001 

1*281662 
1*644864! 
1*969964 1 
2*326348 ! 
3*090223 1 
! 

-0*0724946 
-0*0201808! 
0*06871791 
0*2337877! 
0*8433065l 

1 

0*00776. 

0*05985 

0*09659 

0*07888 

-0*59096 

0*00227 

-0*01082 

-0*02357 

-0*03176 

0*04591 

0-04431 

0-06346 

-0-00731 

-0-21997 

-0-41871 

-0*01657 
-0*03576 
-0*023621 
0*04640' 
0*22738 

0-000484 

0-001843 

0-002195 

0-000386 

-0-008995 

-82*2 

21*0 

143*3 

227*6 

116*8 


The terms in the first four columns agree with, or have been derived from Cornish & Fisher 
(1937). The A’s are semi-invariants derivable from the exact values of the moments given 
at (1-2). 
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As a test, the following is a comparison of the 0‘06 and 0-01 probability points for n~ 25 
as derived by B. S. Pearson (1930) (using a Type VII curve) with the values from (11-9); 


Probability- 

level 

Pearson 

Geary 

(11-9) 

0*06 ' 

0-711 

0-707 

0*01 

1-061 

1-062, 



Variate (s.n. = 1) 

Fig. 3, Frequency of for n = 3. 


With standard deviation cr == 0-435 it is obvious that the differences are not important. 
Sample number 26 is the lowest for which Pearson, computed the probability points, and 
for two levels only. The formulae at (11-9) can probably be accepted with confidence, 

12, OosroLTJSioN 

From frequency formulae (5-2), (6-8), (7-26), (8-8), (9-10) (with (9-6) and (9-9)) and (10-4) 
the probability points for tjbx for normal random samples of w = 3,4,6,6, 7 and 8, respec¬ 
tively, can be determined without difficulty. The six frequency distributions are illustrated 





92 Frequency distribution of Jb^ 

in Figs. 3-8. On each of the frequency curves there is superimposed the normal frequency 
with the same standard deviation, the intention being to enable a contrast to be made 
between the several curves by reference each to the normal frequency, and to show the 
fairly rapid approach of the frequency to normality with increasing n, even for small 

samples.* , 

In this research nothing was so remarkable as the transformai^gln which the single step 
in the iteration, namely that from n = 5 to n - 6, effected in the'^fehape of the frequency 
curve. From n = 6 on, the join at the links is effected so smoothly as to be almost imper¬ 
ceptible to the eye. The eye, however, flatters the actual approach to normality in the 
frequency curves, as measured algebraically by the probability points. 



Variate (s,i). ^ 1) 

Fig. 4. Frequency of for ^ = 4. 


It may be well, at this stage, to recapitulate. Using integral iteration formula (2*13) 
(or (2-14)), frequency ordinates were computed at values of the variate termed the ‘ links ’ 
at which the frequency is shown to have functional discontinuities. Using the exact values 
of the moments (given at (1’2)), and taking into account the known order of contact (| 3) of 
the different functions at the links and the known form assumed by the frequency at the 


shown, the approach to normality is not, however, uniform with in¬ 
creasing n, as indicated, say, by See p. 90 above. 
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known limit of range, inter-link frequencies were determined in polynomial form. Attention 
is directed to the use, at4he == 4 to 7 (inclusive) stages, of the higher moments for the 
purpose of checking the general reliability of the frequency curve (or rather series of curves 
joined at the links). 

Of far greater practical importance, however, are the formulae (11*9) designed for the 
estimation of the 0*10, 0*05, 0*025, 0*01 and 0*001 probability points for normal random 
samples of for There will be little trouble about finding the corresponding formulae 
for other probability links. What degree of confidence can he reposed in these formulae? 
This raises in an acute form the vexed question (on which the protagonists of different schools 
were prone to get very vexed indeed a generation ago) of how best to use moments (or semi¬ 
invariants) for estimating frequency distributions. The general problem was constantly 



, Variate (s.i). = 1) 

Pig. 6. Preqixenoy of ^jb^ for w = 5. 


in the writer’s mind during the present research and he would be glad if his colleagues could 
study the possibilities of the methods which culminated in formulae (11*9) fox bridging 
the chasm which still divides the knowledge (sometimes exact) of the lower moments of 
statistics like and and the formulae (however empirically established) for the frequency, 
in which a measure of confidence can be reposed. This fundamental problem was abandoned 
some years ago in a thoroughly unsatisfactory condition. 

The Karl Pearson approach consists essentially in having regard to the ‘shape’ which 
experience has shown that frequency curves tend to assume and to use the first four moments 
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for the purpose of determining the constants of the curve. The disadvantage of the Pearson 
method is that of itself it gives no indication as to whether the resulting curve closely follows 
the actual frequency: it is necessary to have recourse to such devices as comparing the curve 
with a frequency distribution determined from hundreds of random sample computations 
of the statistic under 6xamination» Apart from the tediousness of this method it is often 
indecisive in regard just to the parts of the frequency which are of most importance, namely 
the ends, because the small numbers which the check computation throws into these zones 
are usually subject to large (Poisson) errors. 



The Gram-Charlier system, on the other hand, can only be used with confidence when the 
frequency is fairly close to the normal. In practice the reliability is judged by the con¬ 
vergence of such terms as one can compute from the moments, i.e. if the successive terms 
show an ‘ unmistakable ’ tendency to dimmish one feels confident in the computed frequency. 

Obviously what both the Pearson, the Gram-Charlier and other frequency systems require 
is a Remainder Theorem. Since, however, an infinite number of moments are required to 
define a frequency distribution, with only a few moments known the most that can be 
expected is that upper (or lower) limits of the probability of the statistic can be established as 
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functions of the known moments. This is what Tchebychev’s Theorem, and theorems of the 
type, do. Too much cannot be expected from the knowledge of a few moments: the approx?^ 
mations are almost invariably too rough for statistical use, when a high standard of efficiency 
is required; and M. Frechet (1937) has shown that the Tchebychev type approximations 
are the best, given the assumptions, which can be made. For all their great mathematical 
importance (incidentally for their justification for the statistician of 'the faith that is in 
him'), it seems to the writer that research on these lines will not produce formulae which 
will be statistically utilizable in general conditions; but he may be quite wrong. 



Knowing the earlier moments the Cornish'-Fisher type expression (depending on the 
Gram-Charlier form of frequency) gives, at any probability level, an expansion for the 
variate to a defined order in the sample number. As might be surmised from the coefficients 
of the normal moments (e.g. (1*2) above), the coefficients in powers of in the expansion 
of the variate usually tend to increase rapidly. In the present paper a remainder term of 
suitable order in n has been added to the known terms in the former.expansion and its 
coefficient found by reference to the (assumed) exactly known expansion for n = S. Clearly 
two more terms (in ? 2 .~^ and ?^"®) respectively could have been found had we iterated the 
frequency to == 9 and n - 10, respectively, though this was not deemed necessary in the 



[ 98 ] 


ON THE COMPUTATION OE UNIVERSAL MOMENTS OP TESTS OP 
STATISTICAL NORMALITY DERIVED PROM SAMPLES DRAWN 
AT RANDOM PROM A NORMAL UNIVERSE. APPLICATION TO 
THE CALCULATION OF THE SEVENTH MOMENT OE 6^ 

By R. C. GEARY and J. P. G. WORLLEDGE 


1. Intbodtjctobt 

The principal object of this communioation is to develop a computational technique appro¬ 
priate to the formula given by one of the authors (Geary, 1933). By way of illustration the 
formula is applied to the computation of the seventh moment of 




ml 


i-l 


(M) 


where are the measures of the random sample of n and of which x is the arith¬ 

metic mean, Universal normality is assumed throughout, 

A glance at formula (3*9) in which this paper culminates will indicate that the task of 
deriving higher normal moments of is not one to be undertaken in a frivolous spirit. The 
work finds its main justification in the conviction of the authors that accurate (if not exact) 
values of the probability points of can be found in terms of the moments of for all values 
of n using a method which has proved successful in the case of the analogous test of asym¬ 
metry', involving ^ 

V&i = 3 = (1-2) 

fftn 


In turn, the importance of the determination of accurate probabilities for ^6^ and for 
normal samples derives from the facts revealed by unpublished work by one of the authors. 
This* shows (1) that probabilistic inferences drawn from the well-known significance tests 
based on the assumption of universal normality are apt to go astray when, in fact, the universe 
is not normal, and (2) that and 6^ provide the most efficient tests of asymmetry and 
kurtosis, respectively, in indefinitely large samples, amongst wide fields of alternative tests 
and of alternative non-normal universes. 

R. A. Risher (1930) has given the exact values of the second, fourth and sixth moments of 
^Jb■^ and J. Pepper (1932) the eighth moment. In the former paper R. A. Fisher also gave the 
values of the second and third moments of The moment field was extended by J. Wishart 
(1930) and in a joint paper by R. A. Fisher & J. Wishart (1931). 0. T. Hsu & D. N. Lawley 
(1940) gave the fifth and sixth moments of 62 * All these authors used the combinatorial 
method due to R. A. Fisher (1929). The present approach is entirely different. 


2. The euhdamental relation 

To make the expose complete it may be useful to reproduce the relevant part (which is quite 
brief) of the 1933 paper. The method used is due essentially to C. C. Craig (1928), applied to 
the normal case. Using, in the usual notation, a prefixed S to indicate ‘ expected ^ or, more 



(2-1) 

( 2 - 2 ) 
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accurately, ‘average value for all samples we have for the characteristic function of the 

Zi = Xi-x (i - l,2,...,n), 

the expression J/exp^ 2 . 

where the are n parameters, so that 

where the are any p positive integers, is the coefficient of 
in the expansion of 

• • * ■ ^ exp x'j -4" ^a {^2 -1-.., *4~ ■ ®)}* 

The exponent can be written in the form 

x-y{tx — i) + x^it^ — {)+... + x^{t^ — i), 

Kl 

where < = S Since the x^ are independent, 


(2-3) 


JSexpZxiiti-i) = n Eexpxi{ti-t). 




(2-4) 


Assuming, as we may without loss of generality, that the normal universe of the x^ has 
mean zero and unit standard deviation, we have 

E exp Xi{ti - i) = exp - if. 

n 

Hence ...Oj,! A^expSya^f —^) = %!aa! .■■a^!exp|2'(Ji —(2-5) 

i—1 

By definition, the power / of a term is given by / = S the dimension hyp. It is clear 

i 

that the required universal mean value of 

E{Xi — . (aip — x)^p 

will be found as the coefficient of Iltfi in the expansion of 




where 2k /. 


n 


( 2 ' 6 ) 


3. The computational scheme 


The computational scheme, which is quite general, will most clearly be outlmed by reference 
to the computation of the exact value of a specific moment (from origin) for the deriva¬ 

tion of which it was primarily designed. Then 


= —^E{4 + zl+... + zif 

n 


= —, nE(-28- 

w\_ 


) + n{n~l) 


7! 

6TT] 


+ iifi!(.20.8) + i®(.16.12-))+«(>.-l)(«-2){5T^ 

+ TT^«-18-84) +3|;^,*(-12‘'4) +12-8‘) 


4!2!ir 


^(•24-4) 

J?(-20-42) 
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-f ?!,(» - 1) (Jl - 2) (w - 3) .®(• 16 • 43 ) + ^( • 12 • 84®) + ■®(8® *) 

+'»{«.-1) (w- 2) (w- 3) {to-4) I^yyThT^^" 2 2! 1^ 


+to(to -1) (to - 2) (w - 3) (to - 4) (to - 6) ^TYjrg] 

+n(TO -1) (to - 2) (to - 3) (to - 4) (to - 6) (to - 6) -S (4’) J , 


(3-1) 


wKere, for example, 

F(-l2-84'‘) = Sz\^44zi== i;(a;i~-^)’-®(a:a-x)®(a:3-x)^(a:4-*)«. 

There are, accordingly, fifteen, terms made up of one of dimension one, three of dimension 
two, four of dimension three, etc. The structure of the numerical coeffioients will be noted: 
in particular that, when the power of a factorial appears in the denominator, its factorial 
also appears. Bach of the fifteen B terms will be evaluated separately, grouped by dimensions 
and multiplied by the TO-factors. 

As already stated, the value of E{a^a^a^...) will be found as the coefficient of 

in the expansion of il j' ~ (3‘2) 

■with V = - 1 /to. In this ease, of course, / = ~ 28. 

Expand ( 3 ' 2 ) in powers of v by the binomial theorem. Each of the v power terms will, in 
general, make a numerical contribution to the value of E(a-^a ^...) which will, accordingly, 
be represented by a polynomial in v of degree 14. The term in v® will be 


ailoa!... 14! 


•(14~s)!(2s)lS: 


(3-3) 


14121* s!(14-s)r“ ”''‘'"“'‘"(aj^-25i)!fii!(a2-2s2)!s2! 

In the i/,, summation extends to all non-negative integer series s^, Sg,..., so that Ss^ = (14 — s), 
81 being associated with a-y, with etc. The values which the can assume are obviously 
restricted further by the condition that 


®i ^ 28{. 

Let the series Sy, Sp ,.be termed the reciprocal factoridl vector (hereafter usually written 
‘r.f.v.’) ofay,ap..., the terms of the vector being regarded as of the order indicated by the 
subscript. The vector will be‘indicated by clarendon type. From the computational point of 
view the following relation is fundamental; 

AxB = AB, (3-4) 

whereAs: (a^ag...) and B — (61 bg...), and any other r.f.v. The multiplication sign at (3'4) 
is defined as follows: the terms of A ate multiplied respectively by v^, etc., and added to 

give a scalar A] the terms of B in the reverse order are also multiplied respectively by 
V®, v^, v ^,... and summed to give S. The coefificients of v®, v®,... in the product (in the 

ordinary sense) AB give the vector AB. Relation (3'4) is immediately'evident from the form 
of T'j in (3'3). From this relation it is quite easy to build up r.f.v.'s from those of lower order 
44 from 4,84 from 8 and 4, 88444 from 8 and 8444, or 88 and 444, etc, 
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Having found all fifteen the second step in the computational process is to form the 

scalar product of each r .f.v. and (25)! v^js !—the latter will be termed the u-^nuUipliers which, 
it is important to note, are the same for all the terms in (3*1)—^which, from (3*3), gives 
E{a^a ^...) divided by 


The latter are multiplied by the numerical factors in (3*1) to give what are termed the 
constant multipliers, ^ constant ’ in the sense that they are the same for all the v power terms 
in each of the jS's in (3-1), but these constant terms are different for the different Por 
example, the constant multiplier for the term 

i2!8U!M! .... 

3!2! l!^2!2i^' 


Note the ‘ absolute constants ^ 7! and 2^^, and that the powers of the term appear as factorials 
in the numerator and factorials one-fourth of these powers in the denominator* In the 
denominator is also a 2! which is the factorial of the factorial power. 

The third step in the computation is to sum the terms of the same dimensions. The final 
step consists in the multiplication of the terms of the different dimensions by the v~factors 
as follows: 

Table 1. v-factors 


Dimension 

v-factors 

1 


2 

— + 

3 


4 1 

-(6p« + 11v» + 6v*+v») 

6 1 

24v® + 60 v'* + 36v* + lOv* + 1 '®’ 

6 

- (120f'« + 274i^» + 226v‘ + 86i^3 + I5v^ + v) 

7 

7 20)^® +17 64j'® + 1 e24>'‘ + 73 6 p® +17 + 21 »< +1 


The v-factors at (3*6) are, of course, the ‘W.-factors in (3*1) with v = — 1/n. 

To deal with the very large whole numbers and their reciprocals which arise in factorial 
computation we had recourse to b, prime number index notation. For this purpose the number 
is factorized into powers of the lower primes—^we have used the notation for primes not 
exceeding 31. Thus 

746,137,129,808,000 6847* 13^* lli*72*63«3^*2» 

is written in this notation 6847[112359], 

the digits in the square brackets [ ] being the powers of the lowest primes arranged in ascending 
order from the right. The ordinary number 6847 will be known as the coefficient and the 
symbolical number in square brackets as the primal of the original number. Note that in 
this example the notation affects an economy from 15 to 10 in the number of digits 
required to describe the number. Should the original number not be factorizable by a 
particular small prime a 0 will be inserted in the proper place, e.g. [10358] means that 7 is 
not a factor of the number represented. If, as often happens with the first two primes, the 
indices exceed 9, decimal points are used, e.g. [124-11*17] means that the original number 
has 2^^ and 3^^ as factors. The primal notation can be used when the indices are all positive 
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or all negative: occasionally, however, + and - signs have to be mixed in the primal (see 
Table 6). 

With little practice great facility is acquired in applying the ordinary rules to numbers 
in primal notation. For multiplication or division .corresponding digits in the primals are 
added or subtracted, the coefficients being dealt with in the ordinary way. In addition or 
subtraction common factors in the primals are immediately evident and the coefficient of 
the sum (or difference) is derived usually by. a single product-sum (or product-difference) 
operation,on a multiplying machine. It may be observed that all the work for this paper 
was executed without inconvenience on small hand multiplying machines with capacity 
9x 8x 13. 

In the following tables the first thirty-two factorials, the r-multipliers and the constant 
multipliers required for the computation of are expressed in primal notation. 


Table 2. Factonals in primal notation 


0! = 1! = 

= [0] 

21 = 

[1] 

3! = 

[11] 

4! = 

[13] 

6! = 

[113] 

61 = 

[124] 

71 = 

[1124] 

81 = 

[1127] 

9! = 

[1147] 

101 =; 

[1248] 

11! = 

[11248] 

121=^ 

[1126-10] 

131“ 

[11126-10] 

14! == 

[11226-11] 

16! = 

[11236-11] 

161 

[11236-16] 


17!= [111236'16] 

181= [111238'16] 
19!= [1111238'16] 

201= [1111248'18] 

21!= [1111349-18] 

22!= [1112349'19] 

23!= [11112349-19] 

24!= [1111234-10'22] 

26!= [1111236'10'22] 

26!= [1112236'10'23] 

271= [1112236'13'23] 

28!= [1112246-13-26] 

29!= [11112246-13-26] 
30!= [11112247-14-26] 
31! = [111112247-14-26] 
32! = [111112247-14-31] 


Table 3. v-Multipliers in factorial and primal notation 


Term in 


Coefficient 

pO 

0! 0!-i = 

[0] 


2! 11-1 = 

[1] 

pa 

4! 2!-i = 

[12] 

p8 

6! 31-1 = 

[113] 

1^ 

00 

.L 

II 

[1114] 

p6 

10! 6!-i = 

[1136] 

p6 

12! 6 !-! = 

[iiise] 


14! 7!-i = 

[111137] 

p8 

16! 8!-i = 

[111248] 

p6 

18! 91-1 = 

[1111249] 

plO 

20!10!-i = 

[1111124-10] 

pll 

22!11!-i = 

[1111226-11] 

pl2 

24!12!-i = 

[11111226-12] 

pis 

26113!-! = 

[11111246-13] 


: 28!14!-i = 

[11111248-14] 
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Table 4, Constant multipliers in factorial and primal notation 


Required for 




computation of the 



undermentioned 




term in (3'1) 




E{2&) 

2817 !/7! 2“ 

= 

[1112246'13-11] 

B(24-4) 

24!4!7!/6!l!2ii 


[nil244-ll-ll] 

.E{20-%) 

20!8!7I/6!2!2i* 

=: 

[H1146'1M1] 

-B(16a2) 

16!12!7!/4!3!2i* 

= 

[1246-11-11] 

i?(20-42) 

20!4|27!/6! l!22!2i^ 


[111134-U-lO] 

A"(16‘84) 

16!8!4!7!/4!2!1!2»< 

= 

[1146-10-11] 

^(122 4) 

12I24!7!/3!“2U!2i‘ 


[236-11-10] 

E{\%-9.^) 

12!8!*7!/3! 2!“2l2i* 


[146-10-10] 

JB(16-4®) 

16l4!«7!/4!l!3 3!2i* 

rr 

[1134-9-10] 

jB(12-842) 

12!8!4!27!/3!2!1122!2i‘ 


[134-10-10] 

-E(83 4) : 

8I‘'4!7I/2!=3!l!2ii 

= 

[■44-8-10] 

i;(12-4‘‘) 

12!4!‘‘7!/3!1!<4!2W 


[12398] 


8124!8 7-!/2!a2!l!3 3!2i‘ 

- 

[3389] 

je(84^) 

8!4!®7!/2! 1S561214 

= 

[2188] 

E(4P )' 

4P7!/1 P712i‘ 


[77] 


The theory will be illustrated by reference to the computation of E^z%z\%\z\ = 

First the r.f.v. 88444 is found as the product 884 x 44 by setting down in equal spaces the 
berms of 884 and on a movable slip spaced to the former the terms of 44 in reverse: 

All primals are negative 


884 

t>7] 

5&7l 

108 

m 03:3 

B03lit2-Ki 

W93 BU'KS 

- . 

3Sd 022*13 

U9 02^10 

ISA3&2S*I^ 

--- 

31&25‘I^ 

&25«17] 













MOVABLE_ 

SLIP ^ 



7 03] 

D1 


- 






The term in 88444 from the position illustrated is that of the 6th order, namely, 

6[4>1^3] + 109[4a2]4-in>7[14>lQ] + 8Q3[112>ll]+1493[lI4a2] 

^ [114* 13](6*7-6+109‘7-6*2 +777*8 +803-9-4+1493-2) 

== [114*13] (3-27737) = 27737[113* 13]. 

The manner of computation is indicated: first the largest (negative) digits in each of the four 
positions of the primals are underlined and the underlined set is regarded as the common 
factor. Note how, at the filial stage, the factor 3 of the coefficient reduces the primal digit 
from 4 to 3. From the entries in the round brackets () it will be clear that, as stated above,, 
the procedure is well adapted to the multiplying machine. The full calculation of 88444 is 
shown in Table 6. 

The identity of the x.f.v.’s from the two factorizations of 88444 constitutes an absolute 
check of the work. The calculation of ^(8^4^) required for (3’1) is completed in Table 6. 
In practice the figures in columns (4) and (5) of this table were derived from those in column 
(3), and in Tables 4 and 6 by entering the latter on two movable slips and folding opposite 
each entry, as required. This stage of the work was rapidly executed. The sum-product of 
columns (1), (2) and (6) give the value of 4^). All the ri.v.^s required for the calculation 
of the jB’s for (3*1) are given in the appendix. 
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Table 5. Calculation of reciprocal factorial vector 88444 
All primals are negative 

Order 

(i) By 884 X 44 

0 : [29] 

1 : [2-8] + 6[29] 

2 : 7[3-10] + 6[28] + 109[3-ll] 

3 : [3'10] + 36[3-10]+109[3'101 + lll[1391 

4 : [4'13] + 6[3-10]4-763[4-12] + lll[138] + 803[112-12] 

5 : 6[4'13] + 109[4'12]-?-777[14-10] + 803[112-ll] + 1493[114-12] 

6 : 109[6-16] + 111[14‘10] + 6621[113-13]+1493[114-11] + 389[122-14] 

7 : 111[15-13] + 803[113-13]+1493[16-13]+389C122-13]+119[124-13] 

8 : 803[114-16] + 1493[118'13] + 389[23'15]+119[124-12] + 1643[226'17] 

9 : 1493[116'16] + 389[123-16]4-119[26-143+1643[226'16]+31[226-17] 

10 : ■389[124‘18] + 119[125-14] + 1643[126-18] + 31[226-163 4-[226-19] 

11 : 119[126-17]+1643[226-18] + 217[226'18] + [22S-18] 

12 ; 1543[227-21] + 31[226-18] + [120-20] 

13 : 31[227-21] + [226-20] 

14 : [227-23] 


r.f.v. of 88444 

[29] 

^[ao] 

9[-ll] 
947[13-10] 
1811[110'13] 
27737[113-13] 
1783141[12B-15] 
= 20627[116'13] 

: 1772417[225'17] 
: 647889[226‘17] 
= 151331[22fi-19] 
= 127[223-18] 

= 2329[227-21] 

= , 37[227-21] 

= [227-23] 


(ii) By 8844 x 4 

0 : [29] 

1 ; [29] + [18]- 

2 : [3-ll] + [18] + 86[3-10] 

3 ! [2-10]+86[3-10]+169[12-10] 

4 : 86[4-12]-fl69[l2-10] + llll3[104-13] 

6 : 169[13-12]4-11113[104-13] + 6137[114-11] 

6 ; 11113[105-16] + 6137[114-ll]-f22703[124-13] 

7 : 5137[115-13] + 22703[124-13] + 9341[126-13] 

8 : 22703[126-15] + 9341[126-13]-f 90641[226-17] 

9 : 9341[126-15] + 90541[226-17]+2463[226-16] 

10 : 90641[226-19] + 2463[226-ia]-H37[126-18] 

11 : 2463[226-18] + 137[120-18]+17[226-18] 

12 : 137[127-30]-f 17[226-18] + [226-21] 

13 ; 17[227-20] + [226-21] 

14 : [227-23] 


= [29] 

= 7[29] 

= 9[-ll] 

= 947[13-10] 

= 1811[110-13] 

= 27737[113-13] 

= 1783141[125-i6] 
= 20627[11B-13] 

= 1772417[226-17] 
= 547889[226-17] 
= 161331[226-19] 
= 127[223-18] 

= 2329[227-21] 

= 37[227-2li 

= [227-28] 


Einally, the E’s are multiplied by the appropriate v-faetors given in Table 1, to give the 
value of E{ml). Now B. A. Bisher (1930) (see also Geary, 1933) has shown that 

= mi) == E{m.l)lE{mf) (3-7) 

-®(w|*) = (M,-l)(n + l)(«, + 3)... (« + 23)(w- + 26)/»iih (3-8) 

Finally, + 211-3^2 +64,80213,164,290-3 

+ 608,584,331 • -i- 26,489,306,481 - 3V + 74,020,784,452- 7 • 3V 

-72,634,861,124 ■ 7 • 5 • + 407,081,273,666 • 7 • 5 • 3«n® 

-1.287,510,783,723- 7- 6- 38m*+ 2,526,463,322,982 • 7 - 5 - 3 V 

- 280,521,238,122 • 11 ■ 7 - 5 - 3 V + 3,036,544,767 -13 -11 - 7 • 5* • 3% 

- 135,393,525-13-11 ■ 72523«)/(ri. +1) (%+ 3)... (w + 23) (w + 26). (3-9) 
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4. CORBOBOBATIOK OF FOBMITLAB 

An integral part of the present work is the technique of check. To be of value the formulae at 
(3*9) and (3*10) must be absolutely correct because (1) any errors made in factorial work are 
fairly certain to be large and (2) the formulae are designed for use when n is small, when 
relatively small errors in the numerical coefficients may materially affect the results. Further¬ 
more, it is almost impossible to avoid error (even in a joint work like the present) with so 
many individual calculations involving numbers astronomically large. As will appear, there 
is a satisfactory, though not absolute, check at the final stage; but if it reveals error it does 
not show where the error occurred, so that, if this were the sole check, there would be no 


Table 6. Calculation of A7(8M^) from 88444 


Term 

(1) 

r.f.v. 

88444 

(3) X r-multiplier 
(Table 3) 

(4) 

(4) X constant 
multiplier [3389] 

(6) 

Coefficient 

(2) 

Primal (nog.) 

(3) 

vO 

1 

[29] 

[-2-9] 

[3360] 


7 

[29] 

[-2-8] 

[3361] 


9 

[•11] 

[1-9] 

[3390] 

y3 

947 

[13*10] 

[-2-7] , 

[3362] 


.1811 1 

[110-13] 

[1-9] 1 

[3390] 


27737 

[113-13] 

[-8] 1 

[3381] 

V6 

1783141 

[125*15] 

[10-1-2-9] 

[13260] 


20627 ! 

[115-13] 

[1100-2-6] 

[113363] 


1772417 1 

[225-17] 

[11-10-1-9] 

[112370] 


547889 1 

[226-17] 

[111-10-2-8] 

[1112361] 

};10 

151331 

[226-19] 

[llll-iO-2-9] 

[11112360] 


127 

[223-18] 

[1111002-7] 

[111133'10'2] 


2329 

[227-21] 

[1111100-2-9] 

[111113360] 

j;13 

37 

[227-21] 

[1111102-2-8] 

[111113661] 

j;14 

I 

[227-23] 

[X11H021-9] 

[111113690] 


alternative but to face the tedium of complete recalculation. It is essential to devise an 
absolute check at each stage. This has been done for the present technique. 

The first check is the 1 (or v = — 1) check. This is applicable to the E's (see (3'1)) of 
dimension one, two and three. It derives from the fact that 

j(i + v)ri|+2vEL«i<|'- (4-1) 

It -will be immediately evident from the latter that when k = — 1 the following terms vanish 
identically: 

(i) all terms of one dimension; 

(ii) all terms of two dimensions except those of the type with which we are not 
concerned; 

(iii) all terms of three or four dimensions in which the highest power exceeds 14: the latter 
being the highest power which, say, can assume in the expansion of 
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Even in terms ofthree dimensions in wkich the highest power is less than 14, e.g. in 

the V = -1 test can be exploited. In fact, from (2-6) and (4>1) the required terms for 


i» = -1 are 


j;(12S'-4) 


14! 12!H! 7! 

10!2P 3!’*2! 14121* 


21* = 12![11124], 


^( 12 - 82 ) 


14! 12! 8!* 71 

■6!^3!2!*>2!14!2i* 


2i*« 12![3116]. 


The sum of these two terms is 131[1236-14] which should be the sum of the four E terms of 
dimension three in (3-1) since ^( 20 - 42 ) and JS((16-84) are zero (for = -1). The checks 
specified in this paragraph were fully applied to the terms of one, two and three dimensions 
in (3-1) before multiplication by the v-factors (Table 1). 

Reciprocal factorial vectors for dimensions exceeding two were checked fully by the 
‘double’ factorization technique exemplified in Table 6. In view of the simplicity of the 
two subsequent processes, namely those of the v- and constant multipliers, this check may 
be taken as establishing the accuracy of the E’b of dimension three or more. Reference may 
nevertheless be made to a check at this stage, namely that the ratios of consecutive coeffi¬ 
cients in each E exhibit a niarked regularity, if correct. Any irregularity (which in the nature 
of the work will usually be large) must be suspect. 

Assuming the accuracy of the E’a in (3d) the final stage was checked by multiplying by 
he r-faotors (3-6) in two ways: 

(i) by straight multiplication using the primal notation; 

(u) by taking (in (3-1)) 

= (l-f v)(1 4-2 v) ... v(1 + v) ... (1 + 5v)Ajj 

= (1-t-v)... (l-h6v){(l-l-6v) AjL-l-vAa}-!-.... 
and computing in successive stages 

Ra = (l + 6r)Ai-|-rAa, Rg = {l + &v)B^ + v^A^, etc. 

The results were the same. 

A satisfactory check for the final stage is that of «, = 4. A. T. McKay (1933) has, in fact, 
given a formula for this value of n from which the seventh moment from zero of b, is found 
to be 

82,220,810,261/6211 • 13-17- 23- 29, 

which value also transpired on substituting 4 for in (3’9). This establishes the accuracy 
of all the formula except possibly the part accruing from the terms in (3'1) in 

«.('«,-1)... (n-4), n{n—l)...{n — ^) and %(« —1)... (n —6) 
which vanish when n = 4. 

If It adds nothing to the check in the previous paragraph it is nevertheless of interest to 
observe that, for n, = 3, the value of the seventh moment of is found to be (|)’ which is as 
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it should be since, in this case, each assunies the constant value f, whether the samples 
are normal or not. 

A partial check is also aflForded at the final stage by the vanishmg of all coefficients of 
powers of v from to inclusive. 


6. CorroLTJSioN 

Previous investigators in this field have all used the combinatorial technique, invented by 
R. A. Fisher (1929) and applied in the first instance to the oumulants, which are linear 
functions of the sample moments. The present writers have not had sufficient experience in 
working the Fisher technique to decide which method is easier to apply. It is quite likely 
that the Fisher method is shorter. A strong point of the present computational scheme is 
that it lends itself to check at every stage; and the method may appeal to students who 
prefer the algebraical or arithmetical to the geometrical approach. For their benefit, and 
also in case it may later be foimd necessary (in connexion with the accurate determination 
of the probability points of for samples of all sizes) to compute higher moments than the 
seventh—^it is almost certain the seventh will be required—^we give as an appendix an 
extended series of reciprocal factorial vectors. From these can be derived without difficulty 
(i) corresponding -B's, e.g. on multiplication by appropriate v- 

and constant multipliers and (ii) r.f.v.’s of higher powers. 


appendix 

A selection of reciprocal factorial vectors required for the calculation of momenta of fornormal 
samples^ including all used for the calculation of the seventh moment^ in primal notation 

All primaU are negative 


Order 

4 

8 

4» 

12 

84 

0 

[ 1 ] 

[m 

[ 2 ] 

[124] 

[14] 

1 

[ 1 ] 

m 

[ 1 ] 

[ 11 ^] 

[4] 

2 

[13] 

[14] 

7[13] 

[26] 

31[26] 

3 - 


[124] 

[13] 

[136] 

7[116] 

4 


[1127] 

[26] 

[1128] 

127[3128] 

6 




[1148] 

17[1138] 

6 




[1126*10] 

[113*10] 


43 

16 

12*4 


84^ 

0 

[3] 

[1127] 

[125] 

[26] 

[15] 

1 

3[3] 

[1125] 

[123] 

r24] 

[13] 

2 

13[5] 

[137] 

13[136] 

6[26] 

17[26] 

3 

3[4] 

[237] 

[36] 

31[136] 

63[126] 

4 

13[17] 

[113*10] 

47[H38] 

323[1139] 

2497[1138] 

6 

[17] 

[1259] 

29[1247] 

[1028] 

173[1137] 

6 

[39] 

[1126-11] 

167[11269] 

43[124*10] 

7[139] 

7 

[11225*11] 

[11249] 

[124*10] 

[1049] 

8 


[11236*16] 

[1126-13] 

[224*14] 

[114*13] 
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Order 

41 

20 

16-4 

0 

[4] 

[1248] 

[1128] 

1 

[ 2 ] 

[1148] 

[1028] 

2 

19[14] 

[113-10] 

11[13-101 

3 

6[4] 

[124-8] 

47[1238] 

4 

49[17] 

[124-11] 

263[124-11] 

5 

6[16] 

[136-11] 

89[126-11] 

() 

19[38] 

[1126-13] 

61[1116-13] 

7 

[38] 

[11226-12] 

1277[11226-12] 

8 

9 

[4-12] 

[11236-16] 
[111238-18] 
[1111248-18] 

229[H234-16] 

[11136-16] 

[U237-181 



8*4 

84* 

4* 

0 

[27] 

[10] 

[6J 

1 

6[27] 

6[ie] 

6[6] 

2 

109[39] 

13[8] 

125[n] 

3 

111[137] 

143[126] 

36[16] 

4 

803[112U0] 

1399[1119] 

646[28] 

5 

1493[114-10] 

239[1029] 

23[8] 

6 

389[122-12] 

6943[114-11] 

646[3-10] 

7 

119[124-11] 

65[104-10] 

36[39] 

8 

1643[226-16] 

277[114-14] 

126[4-13] 

9 

3I[226-16] 

23[U6-14] 

6[4-13] 

10 

U 

12 

[226-17] 

[116-16] 

[5-16] 


12-8 


[137] 
[37] 
31[139] 
7[227] 
299[m-10] 
73[116-10] 
3713[U26a2] 
83[1126-n] 
1181[1236-16] 
47[1237-16] 
[1237 • 17] 


24 

[ 1126 - 10 ] 
[11249] 
[126-11] 
[126-11] 
[224-14] 
[236-12] 
[1137-14] 
[11236-14] 
[11237-18] 
[111239-17] 
[1111248-19] 
[1112349-19] 
[1111234-10-22] 


12-4* 

[126] 

[26] 

10l[138] 

19[136] 

1163[1146] 

63[239] 

629[1106-11] 

479[1126-10] 

773[1126-14] 

13[1120-14] 

[1127-16] 


20-4 

[1249] 

[1238] 

63[12e-10] 

31[12g-10] 

23[124-13] 

13[13S-lli 

16i[1130.i3] 

733[H236-13] 

1163[11237-17] 

587[111238-ie] 

67[1101247-18] 

[1111049-18] 

[1111249-21] 


16-8 28 

® [113-10] [11226-11] 

1 [1129] [11126-11] 

2 13[104-11] [1126-13] 

3 43[114-11] [1136-12] 

4 3823[224-14] [236-16] 

6 1 507[226-12] [238-16] 

6 4933[il36-14] [1237-17] 

7 28943[11236-14] [11337-16] 

8 4331[1237-18] [11248-19] 

9 79[11236-17] [111249-19] 

10 43[11237-191 [1111249-21] 

11 37[11348-19] [111234-10-20] 

12 [11348-22] [1111234-10-23] 

[1112236-10-23] 

[1112246-13-26] 


122 

[248] 
[237] 
23[249] 
47[269] 
289C124-12] 
593[136-10] 
8531[1137-12] 
193[1136-12] 
929[1236- 16] 
113[1238- 15] 
37[1248-17] 
[1249-17] 
[224-10-20] 


8 * 


8*42 


[39] 

[28] 

[- 10 -] 

47[12.10] 

89[101-13] 

281[1I3.11] 

2833[I24-13] 

11[121-13] 

2213[224-17] 

2089[236-16] 

71[236-18] 

[236-18] 

[336-21] 


[28] 

[17] 

86[39j 

169[129] 

11113[104-12] 

6137[114-10] 

22703[124-12] 

9341[125-12] 

90641[226-16] 

2463[226-16] 

137[120-17] 

17[220-17] 

[226-20] 



U. C. Geary and J. P. G. Worlledge 


Order 

24-4 

20*8 

20*42 

0 

[1125*11] 

[126*11] 

[124*10] 

1 

[1025*11] 

[25*11] 

[24*10] 

2 

139[1126*13] 

19[124*13] 

179[126*12] 

3 

47[il26*12] 

331[136*12] 

87[126*11] 

4 

79[226*15] 

7001[236*16] 

161[26;14] 

5 

31[137*15] 

1489[236*16J 

1601[136*14] 

6 

1051[1237*17] 

26513[1237*17] 

1609[1036*18] 

7 

79[11236*16] 

197[11127*16] 

27487[11236*14] 

8 

73[11138*19] 

30211[11247*19] 

13043[11137*18] 

9 

59[10m9*19] 

168713[111249*19] 

209609[111238*18] 

10 

5611[1111249*21] 

310841[1111249*21] 

2197[1101244*20] 

11 

8011[111234*10*20] 

127[1111330*2O] 

63653[1111249*19] 

12 

i697[1111224*10*23] 

4013[11U34> 10*23] 

1873[U11248*22] 

13 

47[1111234*10*23] 

109[111135*10-23] 

101fllll24*10*22] 

14 

[1111234*11*25] 

[111136*10*25] 

[111124*10*24] 


16*84 

-16 .43 

1234 

0 

[113»11] 

[112*10] 

[249] 

1 

[13.11] 

[12*10] 

7[249] 

2 

29[14*13] 

2U[113*12] 

211[25*11] 

3 

37[113.12] 

669[123*11] 

119[26*10] 

4 

52139[225*15] 

3671[123*14] 

3821[126*13] 

5 

9893[216*15] 

2699[116*14] 

18667[136*13] 

6 

299297[1136*17] 

29693[1115*16] 

490283[1137*16] 

7 

251104.3[11237.15] 

10G361[11215*14] 

193[1133*13] 

8 

360131[11235*19] 

624611[11136*18] 

441773[1238*17] 

9 

173497[11237*19] 

884393[11236*18] 

24799[1238*17] 

10 

589[11208*21] 

14113[10236*20] 

2647[1148*19] 

11 

29437[11348*20] 

2771[11138*19] 

677[1249*18] 

12 

3307[11348*23] 

2679[11238*22] 

2707[224*10*21] 

13 

[10249*23] 

23[11238*22] 

23[224* 10*21] 

14 

[11349*26] 

[11239*24] 

[224*11*23] 


12*842 

12*44 

884 

0 

[139] 

[128] 

[3*10] 

1 

7[139] 

7[128] 

7[3*10] 

2 

227[14*11] 

47[3*10] 

236[4*12] 

3 

267[23*10] 

36[39] 

281[13*11] 

4 

97523[126*13] 

319[110.12] 

4177[112*14] 

5 

246[6* 13] 

69971C124-12] 

643[111*U] 

G 

61219[1106*16] 

3151259[1125*14] 

98797[124*16] 

7 

9479[1026*13] 

34637[1U5*12] 

g07[114*14] 

8 

12738433[1237*17] 

1202661[1126-16] 

37161[216*18] 

9 

46223[1136*17] 

115769[1126*16] 

228603[236*18] 

10 

63693[238*19] 

9841[1126*18] 

60333[236*20] 

U 

1937[1228‘18] 

1823[1127*17] 

391[137*19] 

12 

1671[1238*21] 

167[1028*20] 

1163[336*22] 

13 

53[1239*21] 

[1117*20] 

[326*22] 

H 

[1239*23] 

[1129*22] 

[337*24] 
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16 12 

[124-11] 
[24-11] 
187[126-13] 
379[136-12] 
2819[234-16] 
17161[237-16] 
104507[1237-17] 
30317[11237-16] 
431099[11248-19] 
17177[11248-19] 
6613[11249-21] 
1019[1134-10-20] 
991[1234-10-23]- 
31[1235-10-23] 
[1236-11-25] 


12-82 

[14-10] 

7[l4-10] 

73[14-12] 

667[26-ll] 

22697[126-14] 

4679[116-14] 

3267831[1137-16] 

22177[1127-14] 

374281[1236-18] 

204907[1238-18] 

4783[1138-20] 

2261[1248-19] 

1277[1339-22] 

61[1349-22] 

[1349-24] 


g!43 

[29] 
7 [29] 
9[-ll] 
947[13-10] 
1811[110-13] 
27737[113-13] 
1783141[126-16] 
20627[116-13] 
1772417[226-17] 
547889[226-17] 
161331[226-19] 
127[223-18] 
2329[227-21] 
37[227-21] 
[227-23] 
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Order 

84^ 

4^ 

0 

[18] 

[7] 

1 

7[i8] 

7[7] 

2 

261[2-10] 

269[19] 

3 

1061[12'9] 

77[8] 

4 

182843[U3'12] 

2107[M1] 

6 

24391[103‘12j 

1603[1-U] 

6 

m741[104'14] 

29771[3-]3] 

7 

1313[4'12j 

2609[3-ll] 

8 

423697[116*16] 

29771[4-15] 

9 

64827[116-16] 

1603[3-15] 

JO 

11468[106-18] 

2107[4-17] 

11 

47[6'17] 

77[4-16] 

12 

96[106-20] 

269[6-19] 

13 

29[117*20] 

7[6-19] 

14 

[117-22] 

[7.21] 
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THE ASYMPTOTICAL DISTRIBUTION OF RANGE 
IN SAMPLES FROM A NORMAL POPULATION 

By G. ELFVING, Helsingfors 

1. Introductory. Consider a sample of n observations, taken from an infinite normal 
population with the mean 0 and the standard deviation 1. Let a be the smallest and b the 
greatest of the observed values. Then w = b - a is the range of the sample. 

For certain statistical purposes knowledge of the sampling distribution of range is needed. 
The distribution function, however, involves a rather complicated integral, whose exact 
calculation is, for n>2, impossible. Tippett (1925), E. S. Pearson (1926, 1932) and McKay 
& Pearson (1933) have studied‘ and calculated the mean, the standard deviation and the 
Pearson constants /?2 of the range. Fitting appropriate Pearson curves to the distribution 
by means of these paratneters, Pearson (1932) has computed approximate percentage points 
for it. Later on, Hartley (1942) and Hartley & Pearson (1942) have, by numerical integration, 
tabulated the distribution function for n 2,.,., 20. 

As pointed out by Pearson, the distribution of range is very sensitive to departures from 
normality in the tails of the parental distribution. The effect of such departures becoming 
more perceptible for increasing n, the practical importance of the range distribution is, 
perhaps, small for large samples, Nevertheless, it seems to be at least of theoretical interest 
to investigate the asymptotical distribution of range for n-^oo. This is the purpose of the 
present paper,* The results are summarized in a theorena at the end of the inquiry. 

2. The exact distribution. Transformations. The joint-frequency function of the extremes 
a, b reads, as well known, 

AbK^) = n{n^l)<^{a)<i>{b)[0{b)^0{a)]n-^^ (2.1) 

(of. e.g. Cramer, 1946, p, 370). Let u = |■(a^-b) denote the arithmetical mean of the extreme 
values of the sample. Making in (2'1) the transformartion a == u —^w, b = u + ^w and 
integrating with respect to u, we find for the frequency function of the range the expression 

fy,(ro) = ?^(7^-“l) j — (2*2) 

J ^ CO 

The object of our inquiry is the limiting form of the distribution (2*2), It proves, however, 
more advantageous to pass to the limit in the joint distribution of a, b or u, w, before inte¬ 
grating with respect to u. 

The asymptotical distribution of a and b has been investigated by Fisher & Tippett (1928), 
and Gumbel (1936) (cf. also Cramer, 1946, p. 376). According to these authors, we have 

^(u) = 0, D{u) ^ O(log-^^), 

E{w) = 2^(2 log u) + Diw) = O(log-^n). 

From the formulae quoted it is seen that u^O, w->oo m probability as w-^oo. Our first 
task must, consequently, be a transformation of the variables a, b— or u, w—-depending on 
n and intended to stabilize the probability mass, in order to provide a limiting distribution. 

* Prof. H. Wold has kindly directed my attention to this problem. 

t ^{x) denotes the distribution function and =0'(«) the frequency function of the normal 
distribution with mean at a? = 0 and unit standard deviation. 
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Following the example of the authors mentioned above, we should have to introduce the 
new variables a'= h' = n^( — b). 

For our purpose it proves, however, advantageous to subject a' and b' to a ne-w transforma¬ 
tion, independent of n, taking 

xey-2w(S(a) =2n0{-^w + u)A 
xe-y = 2n0{ - b) = 2n0{ - - u). J 


Conversely, 


x=2n^[0{a)0{-h)] - 2n^[0{-^w + vi)0{-lw-u)]A 


I log 


0(a) 


Jlog 


0(-^|W-fU) 


I 

J 


(2-5) 


As agb and thus 0(a)-|-0(-b)g 1, it follows from (2-4), that x, y are subjected to the 
restrictions x>0, xcoshySn. (2-6) 


Performing the transformation, we find 

10(a,6) 


d(x,y) 


2n^f){a) ^{b) ’ 


and thus, letting/,j(a:,y) denote the joint-frequency function of x, y, 


/n(*.2/) 


2n 


-xil 


ajcosh^/X 


71—2 


n 


I 


(2-7) 


( 2 - 8 ) 


This formula is valid in the region (2*6); outside of it, we have to put/^(iK, p) = 0. 

The new variables x, y depend, of course, on u as well as w. It will, however, be shown later, 
that X, for large tends to coincide with the variable 

X* = 2n0{-iMV), 

which depends exclusively on w. For testing purposes^ the former variable may thus, in 
large samples, be used as a substitute for the range. These considerations justify the trans¬ 
formation (2*4) as well as a closer study of the distribution of x and its limiting form. 

3. Limit passage and remainder term* The limiting form of the joint-frequency function 
(2*8) is immediately seen to be 

/(;r,?/) = (3-1) 

The integral of this function, taken over the whole half-plane a: g 0, is easily seen to equal 1; 
(3d) is, consequently, the frequency function of a well-determined two-dimensional dis¬ 
tribution. 


Let the marginal distribution functions in x, corresponding to (2*8) and (3d), be denoted 
l>y F{x) respectively. Our next task will be to estimate the remainder 

I ^h(^) |» which is, obviously, at most equal to the integral 

I 1 d^dy. (3-2) 

To begin with, we estimate the quotient Z^// upwards. By differentiation with respect 
to the variable z ^ x cosh?/, this quotient is found to attain the maximum value 


nj \ nj 


1 


for z = 2, We thus find, for example, 

/, 






(3-3) 
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Tor the further estimations, it proves necessary to divide the domain of integration in (3-2) 
into an interior and an exterior part by means of a convenient abscissa 'f\ = y. In order to 

secure the Maclaurin expansion of log 11 — i ^ cosh ^ j within the interior region, we have to 

■ * 1 / cosii I/ 

choose y so as to satisfy the inequality —^with an appropriate k<l. Taking, for 

simplicity, h — 1 —Vi observing that coshi/ge^, we see that the condition mentioned 

is fulfilled if ^ 

evg-(l-Vi). (3-4) 

Now we may estimate fjf downwards in the interior domain of integration. Expanding 

log ^1 —coshwe find 

log-^ = log|l-l) + ?^cosh7-^g^cosh2,(l-^^^^)“' (0<^<1). (3-6) 

According to the determination of y, the remainder factor is seen to be < 2 for ^ ^ a;, i; g y. 

Omitting, further, the positive term in (3-5) and 


\ n)^ 2n 


For n ^ 3, we have log 
replacing 2 by we find 




hence, combining with (3'3), 

U^,v) 

fiLv) 


| + S'‘cosh®'7 
n 


(3-6) 


In the exterior domain of integration, (3*3) directly yields 

I UE.V)-m>1 1 </(^>(3'7) 

We proceed to the estimation of the integral (3'2), denoting its interior and exterior part 
by /j and respectively. For the former-we have, according to (3*6), the inequality 


/ = rrl4-l 2/d|dw<-f* r(fg + ^®cosh25/)e-S“s''^'d^di/, 
JoJo|/ 9^Jojo 

for the latter, according to (3*7), 


n oo ’ 


(3-8). 


(3-9) 


The integration with respect to ^ may be explicitly performed. We have, in fact, putting 
for brevity cosh 7) = a, 


j* ge-“« dg = ^ {1 - e-“*[l + ax]}, 
jye-o^d^ = I jl -e-[l + 


(3-10) 


(3-11) 


In order to deduce remainder formulas for (a) moderate, (6) small x, we omit in (3*10) and 
(3-11), (a) all the negative terms, (6) the terms with and According to the Maclaurin 


expansion 


6“= =! + «»: +(0<'5'<1), 
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the expression in curled brackets in (3-10) is at most equal to Inserting these 

estimations in (3-8), we obtain for the interior integral the inequalities 


, 16 p dy 15. - ^15 

^ 2WJ 0 cosh ^y~2n ^ 2n ’ 

(3>12a) 

T 16 .f", ^4a:* 

4n Jo n 

(3d25) 

For the exterior integral, (3‘10) yields 



(3-13) 


Finally, we have to join the results (3-12) and (3-13). Gonabining, first, (3-12a) with (3-13) J 
and determining from (3-4) (taken with the equality sign), we obtain, after some slight 
simplifications in the numerical coefficients, 

4<-(l + ~\ {n^6). (3-14a) ' 

n\ n } 

Combining, on the other hand, (3-126) with (3-13), we find 

u 

This expression attains, for fixed x and n, its minimum when y = log^- , For ng 12, this 
value of y also satisfies (3'4), and we obtain, as a parallel estimate to (3'l4a), 

4,<^|-'(log^ + ^j (nfel2). (3-14b) 

The formulas (3‘14a, b) are both valid for all positive re and all n S 12. 


4. The asymptotical distribution. Having established the limiting distribution of the 
variable x defined in {2-6), we are going to examine its properties. 

The frequency function of the distribution considered reads, according to (3-1), 


/(re) = rej" (4-1) 

Changing the order of integration, we easily find the distribution function, the mean and 
the variance of (4-1) to be 




(4-1') 


E{x)^\n, Zi2(x) = 4-i7r2. (4-2) 

The numerical evaluation of the distribution is much simplified by the fact that /(re) as 
well as F(rc) is closely connected with certain Bessel functions. Denote 


By differentiation and partial integration, this function is found to satisfy the differential 
equation , 

= 0. (4*4) 

*v 



Gr. ElFVIKG 


116 


Changing x into —ix, we obtain for the function ^(x) == the equation 




X 


(4*4') 


hence, is a Bessel function of order zero. 

In order to specify this function, we will deduce an asymptotical expression for the 
function (4*3), valid for large x, Bor this purpose, we make in the latter integral (4*3) the 
substitution if = 1 -i-ulx and write 


Performing the integration, we obtain 






(4-6) 


which shows that the Bessel function -^{x) = ^( — ix) tends to zero for a; ^ + i cx). This function 
is, consequently, proportional to the Hankel function (of, Jahnke-Emde, 1909, p. 94). 

Comparing the asymptotical expressions of ^(o;) and we find the proportional 

factor to be Att, whence 

(4-6) 

We proceed to the calculation of F{x). Every integral of xH^^\x) is (cf. Jahnke-Emde, 
p. 166) of the form xH!^\x) + Const., where H^\x) is the first order Hankel function corre¬ 
sponding to H§^\x)\ consequently, 

P{x) -^Hp(ia:)+C. 

TtX 

Now —tends to zero as —[\nx)^e~^ for £c->oo (of, JahnkerEmde, 1909, p. 101); 
hence C = 1 and 

Bor small x^ F{x) has the expansion 


where 


The factors of a; in (4*6) and (4-7) are tabulated in Jahnke-Emde (1909, pp. 135-6). 
Below, we give a short table oif{x) and F{x), The corresponding curves are seen in Big. 1. 



(4-7) 

L 

2 l\a:2 L 2 5\ a;* 

7a:‘^2) 2 \ °^ya;'^4) 

(4-8) 

log;? = 0-11593.... 

(4-9) 


5. Connexion between the variable x and the range. We now turn back to the original 
object of our inquiry: the asymptotical distribution of the range. 

Consider the variable x = 27^ ^[<?( — |w — u)] (6*1) 

introduced in (2-4). As mentioned earlier, 

w->oo, 11 *-^0 in probability (n-^oo). (6*2) 

Under such circumstances, for large n, x may be expected to behave substantially as the 

x* = 2n0(-lw), (6-3) 

which depends exclusively on the range. 


8-2 
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We shall now prove that x*/x-> 1 in probability as «.->oo. According to the well-known 
asymptotic formula 

we may, for | u | <• write , 


X 


-lain}. 


X 

/(*) 

F{x) 

X 

/(-■«) 

F{x) 

0-0 

0-0000 

0-0000 

1*6 

0-3207 

0-6839 

0-1 

0-2427 

0-0146 

2-0 

0-2278 

0-7202 

0-2 

0-3606 

0-0448 

2-5 

0-1669 

0-8163 

0-3 

0-4118 

0-0832 

3-0 

0-1042 

0-8796 

04 

0-4468 

0-1262 

4-0 

0-0446 

0-9601 

0-5 

0-4622 

0-1718 

5-0 

0-0185 

0-9798 

0-6 

0-4666 

0-2183 

6-0 

0-0076 

0-9919 

0-7 

0-4624 

0-2648 

7*0 

0-0030 

0-9968 

0-8 

0-4622 

0-3106 

8-0 

0-0012 

0-9988 

0*9 

0-4380 

0-3662 

9-0 

0-0006 

0-9996 

1-0 

0-4210 

0-3981 

10-0 

0-0002 

0-9998 



Given an arbitrary e>0, we obviously 


such that 


X* 

X 


<e 


may find two positive numbers Ug and 
if w>«;^, 


(>«J 

(5-4) 


On account of (6-2), we may, on the other hand, choose so that the probability of the 
simultaneous validity of the latter inequalities in (6‘4) exceeds l—eifn^ng. Consequently, 



I 



> 1 —e 




which proves our statement, 


( 6 - 6 ) 
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As shown in section 3, the distribution function Fy^{x) of X converges to F{x) as 
Since F(0) = 0, it follows from (6-6), by a well-known method of argument, that the dis^ 
tribution function F%{x) of x* converges to the same limiting function. The asymptotical 
distribution of the range, suitably transformed, is hereby established. 

For practical purposes, it would, of course, be desirable to possess a reasonably accurate 
estimate of the remainder F'^(x) -F{x), or at least an estimate of the difference Fl{x) - F^ix)^ 
to be combined with the results (3*14), 

For n =' 20, the accuracy of F(a:) as substitute for may be checked by means of 
Hartley's (1942) tables. The discrepancy amounts to about 0*004 for ^ Ori, 0*026 for a; — 1. 
and 0*010 for cr — 4. 

The theoretical evaluation of F%{x) ~ F{x) seems to be somewhat complicated and, besides, 
of little use since x*, for most purposes, may be replaced by x. A few remarks concerning 
the relations between x, x* and their distribution functions will, however, be added below. 

To begin with, we note that always x ^ x*, the equality sign being valid only if u 0. 
Consider, in fact, the function x{u), defined by (6*1) for a fixed w. Inserting for 0 its analytical 
expression, we easily find that D^^'^\ogx{u) g 0 for all u. Hence, a;(w) has no minimum and 
at most one maximum, and the latter is, by symmetry, seen to be attained fox u = 0, being 
thus equal to x*. 

From xgx*, it follows that Fl^ix) ^ F^^{x) for all x. We will show that the difference 
F^{x) -** F^(x) may be expressed as a double integral. 

Tlie variables u and w ate, according to (2*4), well-detetmined 
■functions of x and y in the region (2*6); and so is the variable x*, 
on account of (6*3). 

On the level curve x* = Xq^ w has a constant value wq) determined 
2%<5(-itDo) = a:o, (6-6) 

and this curve is, consequently, given in parametric form by the 
equations 

where u runs through all values from -oo to +oo. The latter 
function (6-7) being, obviously, monotonously increasing, we may 
imagine u eliminated, writing (6-7) in the form 

^ = ^ni^o, y) (- 00 < y < oo). (6-7') 

From the proof of the inequality x < x* given above, it follows 
that the function (5-7') has a single maximum for y = 0. When 
y ^ + cX), the function obviously tends to zero. 

The inequality x* g oJq is fulfilled on the left side of the curve (6*7'), the mequa,lity 
on the left side of the straight line X = Xg. Let us for brevity denote the regions (cf. fig. 2) 



OgxgiJXo,y), Ux:,y)<x<Xo ( 6 * 8 ) 

by Ajxg) and ^Jxg) respectively. The difference Fjxg)-F^[xg) is, then, the probability 
of the points x, y falling within the region Dropping the indices 0, we thus obtain 

the expression sought for r r 

i;(a:)-F*(a;) = 

J J £>n{i 


'«(») 




(6*9) 
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Compaiing, finally, the transformed range distribution function Fl(x) directly with its 
limiting form F[x), we find 

n(a;) - m = [w - ~ iw - 


tx J J 

= ff (fn-f)dEdv-f\ fd^dy. (6.10) 


The former integral is, obviously, at most equal to the remainder expression in (3-2), 
estimated in (3T4). 


6. Conclusion. Our main results may be summarized in the following theorem; 

Thboeem. Consider a sample of n observations from an infinite normal population with 
mean 0 and standard deviation 1. Let a be the smallest, b the greatest of the observed values, 

and put ’ / h_a\ 

X = 2n^[0{a) - b)], X* = 2n0i - , 

the lattef variable being evidently a simple transformation of the range of the sample. Then 

(1) x<x*; x*/x-> 1 in probability (m->oo). 

(2) The distribution functions ^(ar) and F*{x) of X and x* tend, for n^oo, to the 

common hmit . 

where Hfl\z) is the first order Bessel function, which vanishes as — (^ 1 for z-^ + ioo- 

\2iJ 

(3) For 71 ^ 12, satisfies the inequalities 


\FAx)~F(x)\<l{l + ^-fj, 



7. Generalization. A great part of our conclusions does not presuppose the normality 
of the parental population. Thus, the distribution (2’8) of the variables x, y defined by (2*6) 
is the same for any continuous probability law and so; consequently, is its limiting form; 
however, if the parental distribution is non-syminetrical, with distribution function 0{x)i 
say, the factor 0(“-b) in (2«5) must, of course, be replaced by 1 — <?(b) instead of G( — b), 
and the variable x* is to be defined by 

X* = 2n^{0{ - ^w) [1 - 0{^w)]}. 

The proof of the statement requires, however, convenient aasumptions con¬ 

cerning the parental distribution. It can be proved that the assertion mentioned—and, 
consequently, the theorem stated above:—are valid if the frequency function of this dis¬ 
tribution is of the form 

9 (x) = Oexpj^-l|a:|»J, 

where l<p^2. 
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LIMITS OF THE RATIO OF MEAN RANGE TO 
STANDARD DEVIATION* 

By R. L. BLACKETT, BA. 

The ratio of mean range in samples of n to population standard deviation cr, which 
has been denoted by is used in control chart work (when the population is assumed 
normal) to estimate cr from the ranges of a set of small samples. On comparing the series of 
values of for different n when the parent population is rectangular with the series when 
it is normal (see table below), it is clear that- for n < 12 the two series agree to within less 
than 10%. With this in mind, the question arises: what are the limiting values of for 
a given to? It is shown here that populations exist for which is arbitrarily near to zero, 
while for no population will d^ exceed the value 




We consider a population whose distribution function is F{x) and which extends from 
-a to -fa- so that F{-a) = 0 and F{a) = h The population in the first place may have 
any finite limits, but there is no loss in generality in supposing these. It is required to find 
limits to the ratio 

p (1) 

d Lll _ 

We apply the calculus of variations and find the extremes of in the class of functions F 
such that jF'( - a) = 0 and F{a) — 1; the case is thus one of fixed erid-points. Suppose that 
jP(a;) = u(x) gives an extreme value and form the functions F{x) = u{x) + tv{x); for t suitably 
near to zero, all these will be permissible distribution functions, i.e. monotonically increasing, 
provided u( - a) = ^(a) = 0. Then for ^ = 0, is zero for all functions v(x), 

j [1 — (li 4* tv)'*^ — (1 — ^ — tv)'^] dx 

Since ---^, 

Q x^(u' x(u' J* 


r ra / ra \2”ii 

2 J x^u'dx — ^ xu^dxj 


ra na 

Now xh dx = —2\ XV dx since t;(a) = t;( — a) = 0, and by the same condition 

J -a J -a 

j xFdx = ~ [ vdx. 

J-a J-a 

Commiinication from the National Physical Laboratory, 
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The numerator now becomes of the form f s{x)v{x)dx, and this must be zero for all 

»/ “ft 

functions v(a;); it is therefore concluded that 5 ( 01 ) is identically equal to zero. In fact 
“ m \ 2“1 

n xH'dx— xu'dxj [(1— 

~a \J -a / J 

= (l—u'^‘'-'(l — uy^}dx xu^dx — x , 

-J -ft j LJ -«■ J 

so that if [i is the mean, cr the standard deviation, the mean rang 0 in samples of and 
F{x) the distribution function of the population which gives an extreme value to we have 

w^^{x — /i) = n(r^[F'^~"'^ — {1 — 

Put X - —a and obtain ncr^ = a); x ^ a gives ncr^ = whence /4 = 0 and 

x^a[F^^-^-(l-Fy^-^l (2) 

This distribution must give an upper limit to since if we consider a distribution of the 
tjpe below: 

Area(l-2i/) F(x):=:{2x’j~l)y — 

Area y Area y F{x) = 1 — (1 — 2a;)^ 

— \ 0 -hi 

the ratio (1) for y — 0{n-^) is approximately ^/(3/2)%^^/ which can be made as small as we 
please. 

Reverting therefore to equation (2) we note that since aw^ = n<r^, ci!^(max.) = ncr/a. 

'•a / \2 

cr2 ;= x^dF—i xdF\ 

J \J -a / 

= r [J’"-! - (1 - FY-^f dl. 

J —a 

Therefore cr^ja^ = - r 2B{n, n)^ 

i.e. djma.x.) = nj^j^^^^{{2n-2)l-[{n-l)\f}y (3) 

It is of interest to note that all the foregoing analysis may he carried out with a equal to 
any finite value and so we may take the limit as u *-^ 00 , and equation (3), which is independent 
of a, will still hold., 

It is easy to verify, by Stirling’s formula or otherwise, that as n increases [(n—1)!]^ 
becomes negligible compared with (2n — 2)!. 

Consequently, for large n, d^(maxO—^ ^{2/(2u -1)} 


Therefore 




B. £i„(max.)=V(^+i)- 

The probability density function of (2) is obtained by differentiation and is 

1 


/(a:) = 
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so that (2) and (6) are the parametric equations of the curve in terms of its distribution 

OJi-—3 2 

function. Thusforw > 2,/(0) = and/( ± a) - ■ The distributions (2) are readily 

seen to be unimodal and symmetrical about x — Q. For tj. = 2 ,3 they are rectangular. For 

X • 1 

and large n, Hence considerations for 

F < I show that for large n and x^O, 

a: I (w — 1)’ 

I'rom (4), (T'-'aj/^rh, Consequently, for any finite a, as n^co the distributions (2) tend to 
a single ordinate at a; = 0. This should be compared with the limiting case giving 
for fixed n illustrated with the diagram above. The limiting form of the two distributions is 
the same but the approach to the limit with increasing n is quite different. There is no 
approach to normality. 

Following is a table of d„(max.) and of in samples from normal and rectangular popula¬ 
tions for n = 2, 12. The quantity \/(n + j) is also included to see how closely (4) is 

approximated. The values of (normal) are obtained from the paper by E. S. Pearson 
( 1 942). For a rectangular distribution is simply 2^j3(n — l)j{n-hl), 


n 


(max.) 

(normal) 

(rectangular) 

2 

1-58U4 

1*15470 

1*128 

1*15470 

3 

1*87083 

1-73205 

1-693 

1-73206 

4 

2*12132 

2*08395 

2*069 

2-07846 

5 

2*34621 

2*34013 

2*326 

2*30940 

6 

2*64951 

i 2*56333 

2*534 

2*47436 

7 

2*73861 

2-74414 

2*704 

2*59808 

8 

2*91548 

2*92076 

2*847 

2*69430 

9 

3*08221 

3*08685 

2*970 

2*77128 

10 

3*24037 

3*24440 

. 3*078 

2*83426 

11 

3*39116 

3*39466 

3*173 

2*88675 

12 

3*53563 

3*53860 

3*258 1 

2*93116 


Some values of d.,^ for a number of symmetrical populations were given by Pearson 
& Adyanthaya (1928) and have been reproduced with some figures for one skew population 
in Tables for Statisticians and Biometricians^ Part II, Table XXIII. The majority of these 
values were obtained empirically from random sampling experiments. These values were 
of course subject to sampling error and for this reason are in three cases very slightly 
above (Z^(max.). 


Some of the preceding work was done as part of the Research and Development programme 
of the Ministry of Supply (S.R. 17) and appears by permission of the Chief Scientific Officer. 
It was completed as part of the research programme of the National Physical Laboratory, 
and this paper is published by permission of the Director of the Laboratory. 
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SIGNIFICANCE TESTS FOR 2x2 TABLES 
By G. a. BARNARD, Imperial College 
Part I 

The theory of statistical significance tests deals with abstractions of experimental results/ 
The fact that the figures dealt with may happen to be tensile strengths of iron bars, or 
perhaps weights of babies, is ignored in the carrying out of the test; and for the purpose of 
statistical theory the experiment in question could just as well be represented by an experi¬ 
ment involving the drawing of balls from urns. In fact, it is an advantage, from some points 
of view, to replace the concrete experiment involved in a particular practical case by an 
‘abstract’ urn-experiment, in order to retain in view only those features of the case which 
can be dealt with by statistical methods. 

It is obvious enough that the first step in the statistical treatment of an experimental 
result may be represented as the replacement of the concrete experiment by an ‘urn- 
experiment’; but the implications of this have not always had the continuous attention they 
deserve. Once the abstract picture has been formed, the analysis of it is largely a matter 
of pure mathematics. What distinguishes the statistician from the pure mathematician, in 
this connexion, should be the statistician’s ability to form valid abstract pictures of concrete 
cases, and his clear recognition of the limits of validity of his abstract pictures. Yet we find 
relatively little discussion in statistical text-books of the process of formation nf these 
abstract pictures. 

It is the purpose of the first part of this paper to draw attention to the confusion which 
may arise through the possible formation of several different abstract pictures, each of 
which may apply to some concrete cases, though not to others. 

Suppose we are given two mass-production processes, A and B, and we wish to test 
whether process A and process B are equally satisfactory, in the sense that neither process 
is more likely to produce defective items than the other. For this purpose we take, say, 
m articles made by process A, and n made by process B, and test them, under suitable con¬ 
ditions. We find that a out of the m articles are defective, while 6 out of the n articles are 
defective, a result which can be represented in the form of a 2 x 2 table (Table 1), 


Table 1 



I (defective) 

II (non-defective) 

Total 

Process A 

a 

c 

m 

Process B 

b 

d 

n 

Total 

r 

8 

N 


The statistical analysis of results of this type has been much discussed, but it seems to 
have escaped notice that, on the facts incompletely stated as above, it is possible to’ form 
several different abstract pictures, any one of which might be appropriate to the real case 
in question. The adoption of one picture rather than another will depend, in a given case, 
on further knowledge which is not specified above. 
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Significance, tests for 2x2 tables 

Th& basis of Fisher’s ‘ exact ’ test 

The current generally accepted teat for results of the above type is that given by Fisher 
(1941), or some approximation to it. The simplest abstract picture* to which this test corre¬ 
sponds would seem to be one in which the m articles made by process A and the n articles 
made by process B are represented by N similar balls, m of them marked A and n marked B. 
The N balls are put into an urn, and then withdrawn in random order. As they are withdrawn, 
the balls are placed, in order, in a row of N receptacles, r of which have been marked ‘I’, 
the remainder being marked ‘II’. The result of Table 1 then represents the observation that 
a of the balls marked A are in receptacles marked ‘I’. The probability of such a result, in 
such an experiment is m\n\r\s\ 

N\a\b\c\d\ 

which can be seen by considering that the contents of the r receptacles marked ^ form a 
sample of r from an urn containing m balls marked A and n balls marked B, the sampling 
being done without replacement. The probability (1), added to those of all results less 
probable than that obtained, is the basis of Fisher’s test. 

In the concrete case given, the N halls, initially similar, may be taken to correspond with 
the N items of raw materials. The process of labelling the balls A and B corresponds to the 
selection of m of the items of raw material, and their fabrication into articles by process A, 
and the fabrication of the n remaining ones by process B, The N receptacles into which the 
balls are eventually placed then represent the N Hest occasions’ which must be provided 
for when the experiment is laid out. The fact that these receptacles are labelled ‘I’ or ‘II’ 
before the balls are placed in them corresponds to the assumption of the hypothesis being 
tested—that the processes do not differ in respect of liability to defectives, so that whether 
or not a given article is defective has nothing to do with whether it is 4 or 5. The labelling 
‘I’ or 'II’ is thus assumed independent of the labelling of the balls. Finally, the random 
allocation of balls to receptacles corresponds to a precaution which might have been taken in 
the concrete case, viz. the random order of test of the article secured by the use of random 
numbers or the hke. 

The basis of the OB.M. test 

Another abstract picture, also applicable to the concrete case as incompletely described 
above, forms the basis of the test to he developed in the later part of this paper, which we 
have called the C.S ,M. test. In this picture, the two processes A and jB, are represented by two 
urns, A and 5, each urn containing a large number of balls, some of which are marked ‘I’, 
while the others are marked' II ’. The selection for test of m articles of process A is represented 
by the random drawing of m halls from urn A ; and similarly for the n articles of process B. 
The test procedure corresponds to the examination of the balls, to see whether they are 
marked ‘I’ or 'II’. The liability of process A to produce defectives is represented by the 
proportion of balls marked 'I’ in urn A^ while similarly represents the liability of 
process B. The hypothesis we wish to test says that say. The probability of a 

result such as that of Table 1 is very nearly 

( 2 ) 

Ihough not the only possible one. By following Fisher’s argument, as given in his book, one can 
construct a more complicated picture which leads to a similar result. 
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which, on the hypothesis tested, becomes 


min! 

aTblclH 




We may notice that the expression (3) differs from (1) by a factor 


11 

rU! 




(3) 


and it would have been obtained in the earlier case if we had assumed that the labelling of 
the receptacles was itself done randomly, by selection of N labels from a box containing a 
large number of labels, the proportion marked ‘ I ’ being p. 

To justify the application of our second picture to a concrete case, we should have to be 
satisfied that the conditions of process A and those of process B were sufficiently stable, in 
a statistical sense, to justify the formation of the notions corresponding to and p^. We 
should further have to make sure that our selection of samples of m and n respectively was 
for practical purposes random. And finally, we should have to be reasonably sure that the 
conditions of test themselves had practically no influence on the results of the test—that the 
test used revealed a real property of the article tested, rather than a property of the in¬ 
dividual conditions of test. 


Another type of abstract experiment 

Another case of common occurrence may be represented by a single urn, containing balls 
each of which carries two marks—one mark being either A ox B, the other mark being 
either or ^II \ The experiment consists in drawing N balls from the urn, at random, and 
examining their markings. If the proportion of balls marked ^ AI ’ is p^^, while p^^^, Pb 2 . 
similarly represent the proportions of the other markings in the urn, the probability asso¬ 
ciated with Table 1 in this case is 

N\ , ^ 

by the multinomial theorem, provided the number of balls in the turn is large. In this case 
the hypothesis'tested, that the markings 'I’ and *11’ on the one hand, and the markings A 
and B on the other, are independent, may be put in the form 

and, assuming that {Pax+Pai)=P' and {^ 1 +Pb%) ^ ^ -P'> and {Pai+Pbx)=P and 
fio not vanish, the probability of our result, on the hypothesis tested, 
can be expressed as 


which differs from (3) by a factor 


N\ 
ml nl 




This shows that (5) is related to (3) in much the same way as (3) is related to (1). 

This situation could present itself in our concrete case if the articles made by the two 
processes A and B were mixed up together in a common store, and the test sample of N 
were randomly drawn from this store, the subsequent conditions being as in the second case. 
Statisticians with industrial experience may perhaps feel it is unlikely that the experiment 
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would be performed in this way; but it must be admitted that it could have been. Cases 
such as this seem to occur more frequently in biometric investigations, where a population 
of animals is being tested for the association or otherwise of two characters. 

Nomenclature 

The naine 'double dichotomy' has been applied generally to all experiments leading to 
results of the form of Table 1, but the foregoing analysis would suggest that it might be more 
appropriate to restrict this term to the third case we have indicated. Since the second case 
can be obtained from the third by supposing the numbers of articles made by process A 
and by process B to be fixed, we might then call the second case the (singly) restricted double 
dichotomy. Similarly, the first case would be called the doubly restricted double dichotomy. 
Such a nomenclature, apart from a lack of euphony, would be open to the objection that it 
would tend to imply that the third case was the general one, the first two being derivatives 
of it. This, in turn; would imply that the subject-matter of our investigation in cases one and 
two was in reality a four-fold universe, the restrictions on numbers being merely matters of 
experimental technique. But such is not always the case. The question implied in our second 
case presupposes two two-fold populations, which are to be compared, and no four-fold 
super-population need exist for this question to have meaning. 

We therefore propose the names ' double dichotomy' for the third case, '2x2 comparative 
trial’ for the second case, and '2x2 independence trial’ for the first case, though here again 
an objection on aesthetic grounds would be easy to sustain. 

Finer distinctions 

In principle it could be maintained that there is a distinction between the 2x2 compara¬ 
tive trial, as instanced above, and a restricted double dichotomy. As we have said, the funda¬ 
mental subject-matter of a 2 x 2 comparative trial is a pair of populations; while the subject- 
matter of a restricted double dichotomy is a four-fold population from which we happen, by 
an accident of experimental technique, to be able to extract samples in which the numbers 
of items having certain characteristics are fixed. The latter case could arise, for example, 
if an attempt was being made to discover association between colour of eyes in school¬ 
children and some less easily identified characteristic, such as membership of a particular 
blood-group. We could imagine that an experimenter might pick out m children with (say) 
blue eyes, and n without blue eyes, and then, having obtained his samples, he might subject 
them to a test for blood-group. The conclusions drawn from such an experiment would 
presumably be intended to apply to the population of school-children, a four-fold one relative 
to the two characteristics in question. The distinction between the two cases comes out if 
we consider what happened if, in the 2x2 comparative trial, all items tested turn out to be 
defective. In this case we should say that our question, whether == pf^ or not, tends to be 
answered in the affirmative. In the case of the school-children, if they all turn out to have 
the same blood-group, then no conclusion on our question about the four-fold population 
can he drawn at all. 

Similar distinctions apply to the 2x2 independence trial. In the psycho-physical experi¬ 
ment described by Fisher (1942), where the point at issue is whether or not a lady can' tell 
whether the milk or the tea has been put in the cup first, no statistical population is pre¬ 
supposed. The question would have meaning- even if we refused to regard the order of in¬ 
sertion of milk or tea as ever being a matter of chance, while at the same time we regarded 
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the lady’s guess as equally determinate. The ‘statistical population’ enters into this experi¬ 
ment only in the experimental technique, via the randomization procedure used to fix the 
order of presentation of cups; it does not enter into the question being asked. In this case, 
the extreme result, in \vhich in fact the milk was put in first every time, while the lady 
guessed every time that it was otherwise, would be taken as evidence against the lady’s 
claim. But such a result could by itself have no meaning for the question asked in the case 
of a restricted 2x2 trial or a doubly restricted double dichotomy. 

Further types of experimental procedure leading to results expressible in the form of 
Table 1 are the various sequential procedures that hsbve been described for deciding questions 
of the kind we have been discussing (3, 4). Yet another procedure is one where the conditions 
of trial vary from one block of tests to another—as when an open-air trial runs over several 
days of inconstant weather. Here we might suppose there were k pairs of urns, 

(^ 2 )^ 2 ), The distinctions here are, however, obvious enough, and they are 

worth noting only in order to emphasize that the mere fact there results are presented in the 
form of Table 1 is not in itself sufficient to specify an appropriate test of significance. 

Part II 

The significance test for the 2x2 trial 

Roughly speaking, the object of a significance test as applied to results of the type con¬ 
sidered, is to answer the question: Can these results be ascribed to ‘chance’ ? In this form, 
the question is not sufficiently precise. If our ‘urn model’ for the 2x2 comparative trial is 
adequate to represent the experiment actually carried out, then the results will in any case 
be ‘ due to chance ’, in some sense. What we wish to know in this case is whether a particular 
kind of chance—namely, one in which P —account for our results. 

If the. results are such that this explanation of them is untenable, then we may conclude 
either, that our particular ‘ urn model’ of the experiment is inadequate anyway; or we may 
.retain the model, and conclude that and Pfj must be unequal. In most cases, of course, 
we shall reach the latter conclusion, since we would not have made up the urn model in 
question unless we had some reasons for believing in its adequacy; but it is well to bear in 
mind the first alternative, in case a re-examination of the circumstances may make us change 
our minds. A point very strongly emphasized by Fisher in his book The Design of Experi¬ 
ments is, that we ought to have in mind a particular ‘urn model’ before the experiment is 
performed, and arrange the conduct of the experiment so that the adequacy of this urn 
model is not likely to be questioned afterwards. 

With the qualifications indicated, we can say that the object of the significance test we 
propose to develop is, to enable a particular class of explanations of our experimental results 
to be ruled out as untenable. Specifically, given results like those of Table 1, we want to be 
able to say that they could not be accounted for by supposing that the experiment we 
actually performed was analogous to the urn experiment with two urns in which = Pb= P^ 
This raises the question, in what sense could such a supposition fail to account for the 
observed results ? Any result of the form of Table 1 could arise in an experiment of this kind, 
when our supposition is true. Why, then, should we select some results of this form and say 
they are incompatible with our supposition? 

In the last analysis, this question cannot be answered without an examination of what is 
meant in general bv statements involving probabilities, a point which is still the subject of 
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controversy. But in our particular ease (if not in all cases) we can avoid giving a general 
answer to the question of what probability is, by considering the practical circumstances 
which form the setting for our particular problem, and the uses to which we propose to put 
the answer. In fact, in our case we are interested in the equality or otherwise of and 
because we want to decide which of the two processes, A and B, is to be preferred, from the 
point of view of defectives produced. To say that is greater than will mean, for us, that 
process B is preferable, and conversely if p^ is greater than p„, while to say that p„ and p^ 
are equal will mean that there is nothing to choose between the two processes. In fact, to 
say that p^ p^,, in our case, means that, if process A and process B are both used, then it 
will be found that the frequencies with which defectives appear in the two processes will, 
for practical purposes, be equal* Thus we shall assert that results in which the observed 
frequencies, ajm and bln, differ widely, are incompatible with the supposition that p^ = p^; 
in doing so, we shall be neglecting as impossible a class of events which are in reality logically 
possible, but whose probability is small The precise formulation of a test of significance 
then reduces to a precise formulation of what is meant by a ^ wide difference ’ in the fre¬ 
quencies ajm and bjn, and to an evaluation of the probability of those events which are being 
neglected as impossible. 

The lattice diagram 

If we consider the first problem, of arranging results like those of Table 1 in order of the 
relative ‘ width ’ of the differences they indicate, a first step is the enumeration of all possible 
results in a convenient form. 

Logically, we should begin by noting that Table 1 is really an abbreviated version of the 
results of any one particular experiment, which will to start with be like those of Table 2 
(where we have taken m = 8, n = 6, for definiteness). 

Table 2 

Urn: AAAAAAAABBBB BE 

Mark: II I II II I H II II I I II I I I 

But if, as we are presupposing, our urn analogy is adequate to represent the conditions of 
the experiment, the order in which the results were obtained must be irrelevant to the 
interpretation of results. If the conditions of trial varied during the course of the experiment, 
this assumption might not be correct—for example, if the trial were an open-air trial, and 
it began to rain half-way through. We are assuming that the urn analogy is adequate, and 
.so we must treat all results like Table 2 which give the same values to n, 6, o, d, in Table 1, 
as equivalent. Table l therefore stands for m!?i!/a!6!c!d! distinct, but equivalent, results 
which we shall not distinguish from now on. 

If we now take rectangular axes in a plane, we can represent Table 1 by the point whose 
coordinates are (a, 6), Thus * a; ’ in T’ig. 1 represents the set of results equivalent, in the sense 
of the pirevious paragraph, to the results of Table 2. At the same time, all possible results 
of the experiment which gave rise to Table 2 are represented by the points of the rectangle 

* We hope that the qualifications we have attached to our statements will be sufficient to guard us 
against the accusation that we have adopted in full a ‘frequency theory ’ of probability. The frequency 
in erpre a ion is relevant to our particular problem ; other problems may involve other interpretations. 
More than one interpretation may be relevant in a single problem^ 
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PQR8. Wo call this representation of possible results the lattice diagram.* Our problem 
may now be regarded as one of ordering the points of the lattice diagram according to the 
‘width’ of the difference they indicate. 

Conditions S and G 

In trying to make the idea of ‘width’ of difference precise, we are up against difficulties 
similar to those attaching to the interpretation of results on the basis of incomplete infor¬ 
mation about the circumstances of the experiment. The information given at first was 
compatible with several distinct ‘urn models’. Similarly, the information given now is 
compatible with several different notions about ‘width ’ of difference. We may he concerned 
with the arithmetical size of the difference Pa—Pb> or with the ratio pJPbi the 

logarithm of this ratio, or with some more complicated function. 

Logically, therefore, we should expect to set up various tests, based on various ideas of 
what constitutes ‘width ’ of difference in probability or frequency (in Neyman and Pearson’s 
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language, corresponding to various weight functions over the space of alternatives to the 
hypothesis tested). But here a factor which may simply he described as laziness enters in. 
If we carried our ideas to their logical conclusion, we should find ourselves constructing a 
new test for almost every new experiment we had to deal with; and the time and effort 
involved in this are too great. Consequently, we confine our attempt to producing a test 
which will be reasonably applicable to a wide class of cases of the type specified, without 
suggesting that this test is unique, or ‘best possible’. 

First, then, in our ordering of points in the lattice diagram, we propose that the same rank 
should be given to the point ((m —a), 6)) as to the point (a, 6). This condition we propose 

to call the ‘symmetry condition’, or ‘condition S\ It amounts to saying, that if Table 1 
is to be considered as indicating a real difference between p^ and then so is Table 3, in 
which the labels ‘I’ and ‘II’ have been interchanged. If, when we are testing whether 

* Kot the sample space of Keyman and Pearson. In the sample space, different results equivalent 
to Table 2 are represented by different points. 

Biometrika 34 9 
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Pa == Pb> say we are also testing whether 1 = 1 -pi,, from the same point of view, 

then this symmetry condition is clearly justified.* 

Next, we propose that in our ordering, the two points which, respectively, have the same 
abscissa or the same ordinate as (a, &), and which lie further from the diagonal PR, shall be 
considered as indicating wider differences than (aj)) itself. Thus, referring to Fig. 1, the 
points immediately above and immediately to the left of the point are reckoned to 
indicate wider differences than the point ' x' itself. This condition implies that the set of 
points indicating differences as wide or wider than (a, 6) will have a shape property vaguely 
related to convexity, and we call it the ‘ C condition’. It means that if we consider the 
table corresponding to Table 2, with cell frequencies 

2 6 

5 1 

as significant evidence of difference, then we must also consider the tables 

1 7 and 2 6 

5 1 6 0 

as significant evidence of difference. It is difficult to imagine circumstances where this 
would not be so. 

Geometrically, condition S implies that we can in future restrict our considerations to 
points in the lattice diagram lying on or above the diagonal PR, i.e. in the triangle PR8. 
And condition C implies that, in this triangle, our Vidth of difference’ must increase as 
we go upwards or to the left. If horizontal and vertical axes are taken at any point X in this 
triangle, points in the second quadrant are associated with a wider difference than X is, 
points in the fourth quadrant are associated with narrower differences than X is. The relative 
width of differences associated with points in the first and third quadrants (excluding the 
axes) are now determined by the conditions C and 8. The ordering generated by these 

conditions is thus a partial, not a total, ordering; it is, in fact, a kind of conical order, in the 

sense of A. A. Robb. We must introduce some further condition to make the ordering total. 


Probability considerations 

In many simpler cases, it is possible to distinguish those events which are considered 
incompatible with a given probability hypothesis by their relatively low probability, 
compared with other possible events. Such a simple comparison of probabilities is not open 
to us in this case, because to each point (a, b) we have, on the hypothesis tested, associated 
a function 


W{a,b;p)^ 


m\n\ 

a\b\~cYd\ 


p^(l-pY 


which contains the ^nuisance parameter’ p If we consider the relative position, in our 
ordering, of another point, (a', 6'), we have to consider the inequality 


W{a,b‘,p)<W(a\b^)p), ( 6 ) 

the truth or falsehood of which depends, in general, on the unknown and there is nothing 
in the statement of the problem, nor in the experimental method, to justify any particular 
choice for the value oip. 


* Cases where is impossible are hereby neglected, strictly. 
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If (a + 6) ~ (a'+ 6'), the validity or otherwise of the inequality (6) is independent of ^3.- 
Thus, using this inequality as a criterion for ordering our points, we can jsay that in the 
triangle PRS, the ‘width of difference’ must increase as we move north-west. But this is 
all that can be derived from this criterion, and it is clearly even less helpful in ordering the 
points than the condition^ 0 and 8 are. Moreover, if we recall that each point (a, b) in the 
lattice diagram really represents a set of m! n\ja\b\ c\d\ distinct results, each with probability 
the criterion (6) loses its plausibility. 

We might try to improve the situation by associating the function ir(a,6;p).with a 
number, depending on a and b only. For fixed a and 6, this number would be a functional 
of p). We should clearly require that, if the inequality (6) is true for allp, then the 

corresponding inequality should be true of the numbers associated with W{a,b\p) and 
W{a\ 6'; p). The simplest functionals which satisfy this condition will be the mean value, 

w{a, 6) = J ]F(a, 6; p) d^, 

the maximum value w'{aj b) = max W(u, 6; p), 

0<p<l 

and one single value w''{a, b) ~ W{a, 6; po). 

Circumstances could be imagined in which any of these three criteria might produce 
reasonable tests of significance. For example, in certain genetical experiments we may have 
reason to suppose that the value p = 1/B would occur more often than any other value. In 
such a case we might use withp^ = 1/3. But for general purposes taking p^ = 1/3 could 
not be justified. 

We might again argue that taking w as our criterion would correspond with the assumption 
that all values of p were a priori equally likely. But some would say that such an assumption 
was never justified; while those who would admit the assumption would in strictness do so 
only if we really did know nothing about the value of p. And in the general circumstances we 
are trying to cater for, we may sometimes know something vague about the value of p— 
such as, for example, that p will be less than 

TSTeyman and Pearson have shown that the likelihood ratio, which in our case comes to be 

ver}^ often gives a good basis for ordering experimezital results. We feel, however, that the 
criterion we shall describe in the next section has a slightly more direct jnstification than 
the likelihood ratio, though the choice, is, admittedly, largely a matter of taste. 


The maximum condition 

Before setting out the final condition which, with conditions 8 and (7, will be used even¬ 
tually to arrange the points of the lattice diagram in order of ‘relative width of difference 
indicated’, we need to consider the assignment of significance levels to various results. 

When we say that a given result is not significant on, say, the 5 % level, we mean that 
such a result, or one indicating a wider difference, could occur, with probability at least 0‘06, 
even when p^ = p^. We could believe in a theory that p^ = p^, without having to suppose 
that an event belonging to a class whose, joint probability was less than 0*05 had occurred. 
Conversely, if a result is judged significant on the 5 % level, it means that no theory which 
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assumed that “ Ph could account for the result obtained without supposing that an event 
of a type whose probability was less than 0-06 had occurred on the occasion in question. 

Let us now consider a specific case, in which we choose numbers which in practice would 
be ridiculously small in order to save arithmetic. Suppose, in fact, m = n = 2, while a = 2 
and 6 = 0. It follows from conditions S and G alone that in judging the significance of such 
a result we need consider only the probability of this result, together with its converse, in 
which a = 0, 6 = 2. probability of results of this type is 

Now suppose that we are prepared to discard as untenable theories which require us to sup¬ 
pose that events of probability less than 0*06 had occurred. In such a case, we should discard 
a theory which supposed Pa-Pb- since in this case P = 2(0'!)^ (0-9)^ = 0*0102, less 
than 0-05. But we could not discard a theory which supposed Pu ^ ^**^5 since in this 
•case P = 0*125. Infact, our result would enable us to discard all theories involving —Pb=Pi 
except those for which p lay in the interval 0*197 <p < 0*803. In particular practical cases 
we might be prepared, on grounds external to the experiment in question, to dismiss the 
possibility that p should lie in this interval; and in such cases we should be entitled,to say 
that the result excludes the possibility that — Pu¬ 
li is easy to see that the above specific case is typical. Any set of points in the lattice 
diagram, considered by some criterion agreeing with conditions S and G to indicate differ¬ 
ences as wide or wider than those of a given result, will be associated with a probability P, 
on the assumption Pa^Pb-P\ Q'l^d this P will be a function of p, rising from zero when 
p 5 = 0 to a maximum in the neighbourhood of p and then falling again symmetrically 
(by the 8 condition) to zero again at = 1, somewhat as in Fig. 2. The given result by itself 



will exclude the possibility^^ = p^^ altogether, only if the si^ifi'cance level adopted is greater 
than P,.^p the maximum value of P, If our significance level corresponds ,to a probability 
less than P,,^, then all we can say is, that our result is incompatible with p^ = pj^ unless their 
common value lies in a certain subset of the range (0,1). We may or may not exclude these 
latter possibilities on other grounds. 

In trying to construct our test, however, we have set ourselves the task of evaluating the 
evidence provided 6?/owrea:jpenme7^^^a^o^^e in relation to the hypothesise^ = It now appears 
that this is impossible so long as we restrict ourselves to the form, usual in such oases, of a 
simple statement that a given result is, or is not, significant on a given level. We have two 
alternatives. Either we can fiiM an entirely new form of statement to convey what we wish 
to express; or we can adhere to the form of statement, and try to make the situation fit the 
form as nearly as possible. Perhaps the day will come when experimenters do not require 
answers in the form of numbers, when they are sufficiently versed in generalized mathe¬ 
matical analysis to be content with a function (such as the function P(p)), instead of a single 



G. A, Babitard 


133 


number, But we have not yet reached this stage; and so we propose to take up the latter 
alternative, and try to make, the situation fit the standard form of statement of significance 
tests as nearly as possible.* 

Our difficulty arises from the dependence of P on p. If the graph of P against p were a 
horizontal straight line, our difficulty would be overcome. What we propose, therefore, is 
to try to make the graph of P against p as near to a horizontal line as possible, by suitably 
adapting our idea of what is meant by ‘width of difference \ In making this adaptation, we 
shall secure that we do not violate the common-sense requirements as to the meaning of the 
term ‘width of difference \ by requiring that conditions 0 and S should always be satisfied. 

The maximum condition 

The condition C requires that, of all points in the triangle PRS, that indicating the 
^widest difference ’ must be the point 8 at the corner (Pig. 1). The function P associated with 
this point and its converse, Q, which we may denote as P(0,6; p), is 

P(0, 6; p) = p®(l —p)® +p®(l —p)® 
and the maximum P„j occurs here when p ~ where we have 

PJO, 6) - 1/213 1.22 X 10'^ 

The condition G requires that the only points which might be considered as coming next 
after 8, in order of decreasing ‘width of difference’ are (1,6) and (0,5). We have to adopt 
some principle to choose between these two. 

If (1,6) were taken next after (0,6), the function P associated with it would be 

P'(l,6;p) == P(0,6;p)-Hl6p’(l^p)’ 

and 6) would come to 9/2^3 = 10‘97 x 10~^. On the other hand, if (0, 6) were chosen 
next, instead of (1, 6), we should have 

P(0,5;p) = P(0,6;p) + 6[pS(l-p)s,-fp5(l-.p)9] 

and P^^(0, 5) would come to 8*58 x 10'"^, the maximum occurring when p = i± 

Thus Py^i(0,5) is smaller than P,„(l, 6), and this lower maximum is associated with a flatter 
curve of P(0,6; p). Since a flat curve is our aim (the horizontal line.being the ideal), we 
choose (0,5) as the point to come next after (0,6), rather than (1, 6). 

Having chosen (0,5) as the next ‘widest difference’ point, the G condition restricts us to 
the points (1,6), and (0,4), as candidates for the next position. We consequently compare 

P(l, 6; p) =. P(0, 5; p)+ 16p7(l ^p)^ 
with P"(0,4;p) = P(0,5; p)-h 16[p^(l —p)i®+pi®(l —p)^] 

and the lower value of P,^, as criterion shows that (1, 6) is now to be taken. At the next stage, 
we shall have to compare the functions associated with (0,4), (1,5) and (2,6). In this way 
we can arrange the points of the lattice diagram in order, step by step. 

The principle involved, which we call the ‘maximum condition’, may be formally stated 
as follows; 

Considering only points for which ajm is less than bjn, if the first {n-- 1) points 
(^ 2 >^ 2 )> order of decreasing ‘width of difference’ have been chosen, and 

* In the example just taken we might make a kind of ‘conditional confidence interval statement’, 
that, if p existed, we should have 0*197 <;p< 0*803 with confidence coefficient 0*95, 
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i® associated with the function P{a.n-vK-x\P)> then the «th point, is 

that point, of all points (a, b) permitted by the G condition, for which 

PK_i, 6„_i; p) + iP’'(^ -py+p’^ii ~pY)j 

is least. (a.„, b^) is then associated with the function 

PK,K-,p) =-- p{<^n-i>K-i\p)+^^^^w-py+p^i^-pn 

To complete the specification of the ordering, we have to legislate for the case where there 
are several points giving the same value ofP^{a, b), this value being less than that associated 
with any other permissible point. In this case we lay down that all such points are to be 
given the same rank, and the second term in the expression for P{a^, p) is to be replaced 
by the corresponding sum over all these points. If there are h such points at any stage, then 
the next point after them will be denoted as the {n + ^;)th point in the ordering. This requires, 
for example, when m = n, that the points (a, b) and {b, a) are always to be taken together. 

Finally, the significance level to be attached to the point (a„, b^) wiU be 

Pm{an,bn) = max P(a„, 6^; p). 

0<P<1 

This guarantees that our test will be a 'valid' one, in the sense that, if we judge a result 
incompatible with the hypothesis = Pf^, on a given level of signifieance, then all the 
possibilities of the form are excluded, to the given level. Thus no further information, 

external to the experiment in question, could thalce us decide that a result judged significant 
by our test was not in fact so (holding, of course, to a fixed significance level); on the other 
hand, we still have the possibility that other information may lead us to consider as signi¬ 
ficant results which appear in themselves not to be so. The formulation of our maximum 
condition is made so as to minimize this latter possibility. Our test is thus conservative, in 
the sense that we do not draw the conclusion Pa =¥Pb unless this is certainly warranted by 
the data; but it might be called' progressive conservativebecause, of all such conservative 
tests, it will be the least conservative. 


Another aspect of the maximum condition 

When the author first approached the problem of analysis of experimental results of the 
type now considered, he did so from the point of view of regarding the significance level to 
be used as being fixed in advance, say at the 5 % level. From this point of view, the problem 
of constructing a test resolved itself, not into one of ordering the points in the lattice diagram, 
but into one of choosing a region, or set of points in the lattice diagram, such that any point 
belonging to this region could he regarded as evidence of inequality ofp^l and^^, on the given 
level of significance. The condition of symmetry required that such a region should consist 
of two similar parts, one above the diagonal* PiJ, and one below it. The condition G required 
that the part of the region lying above the diagonal PB should be so shaped that if a point 
X belonged to the region, then so would all points lyingnorth or west of X, There remained 
the problem, to decide which of the many regions satisfying these two conditions should be 
the one adopted, 

To settle this, to any such region R we can associate a function 


P{R;p)^ S 


mini 


6 ! c! c?! 


p^{i-py 
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and such a region will give a ‘ valid ’ test of significance provided that 

Max P(i2; <0-05. 

0<33<1 

There will not be so many regions satisfying this validity condition as well as the conditions 
;S^ and C, We proposed, therefore, to select that region from among these, which had the 
greatest number of points in it. This last condition was what we then called the 'maximum 
condition’. The fact that this region would not be unique in cases where-m = n was taken 
care of by requiring a subsidiary symmetry condition that in such cases (a, 6) and (6, a) 
should always be taken together. 

What we have now adopted as the 'maximum condition’ can be seen to be related to this 
earlier version, by the consideration that, roughly speaking, apart from effects due to the 
discreteness of the lattice diagram, holding the number of points in the region constant, and 
then choosing the region which gives the lowest value for as we do now, comes to the same 
thing as holding constant, and then choosing the region to have the maximum number 
of points. 

Other things being equal, the 'power’ of a test, in the sense of Neyman and Pearson, will 
increase with the ' volume ’ of the rejection region chosen. In this sense we can say, roughly, 
that the maximum condition secures that our test should be as powerful as possible, con¬ 
sistent with validity. 

Practical formulation of the test 

Some statistical tests (such as that due to Pisher, already mentioned), can be carried out 
in the form of a direct calculation from the data, without reference to any special tables. 
Most other tests require the use of special tables which, however, are for the most part tables 
of single or double entry, perhaps triple entry, if the level of significance is regarded as a 
variable. In our case, regarding the level of significance as a variable, a table of quadruple 
entry would be required. 

Ideally, a set of tables, one for each pair of values of m and n (m ^ n) would be required. 
The table would be in the shape of a right-angled triangle, corresponding to the triangle 
PRS of Pig. 1, and divided into squares, each square corresponding to given values of a and b . 
Within each square (a, 6) would then appear a number, the value of P^n(a, 6). This value of 
P^id^b) then is the maximum probability of obtaining the resillt (a,6), or one indicating a 
wider difference, if Pa = Pt* ^ comparison of Py^fa^ b) with the significance level adopted will 
then decide the.significance or otherwise of our result. In any particular case we shall be able 
to see which tables, in the sense of our test, are regarded as indicating a wider dijference, by 
noting which points are associated with lower values of Pj^ia^ b). 

In practice, it will be impossible to construct such tables for a large range of values of m 
and n. But for larger values of m and n, a test based on a normal approximation to the dis¬ 
tributions involved will be quite adequate for practical purposes. In fact, the test we have 
proposed will itself approximate, in some sense, to a test based on the normal distribution, 
though we do not enter into a detailed discussion of the relationship between the two tests 
here.* Tables are thus required for our test only for small values of m and n. In spite of 
advice by statisticians to the contrary, such small values of m and n continue to occur 
frequently in practice. 

* The general question of the sense in which tests are regarded as * asymptotically approaching’ 
normal tests is a subject for another paper. Professor Pearson’s paper which follows, bears on this point. 
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In the Appendix W6 give specimen tables for the cases where iV = 14. The comparative 
figures for the Eisher test, also given in the Appendix, indicate that the-differences between 
the two tests are appreciable. An exploration is now under way into larger values of m and n, 
and it is hoped to report on this in due course. 


Other applications of the procedure 

We have spoken of our test as the O.S.M. test, as if the case dealt with above were the only 
case to which the procedure adopted was applicable. But similar methods could he used 
in many other oases. In particular, a method closely following the one we have used might 
be applied to the case we have called the double dichotomy, which differs from the 2x2 
comparative trial in that two ‘nuisance parameters p and p' are present, instead of only 
one. The 2-dimensional lattice diagram of the 2x2 trial is replaced by a 3-dimensional 
I’egufar tetrahedron of points with homogeneous coordinates (a,6yCy(i}i connected by the 
relation . j -kt 


Two opposite edges of this tetrahedron correspond to m = 0 and n = 0, and sections of the 
tetrahedron by planes parallel to these edges will look exactly like lattice diagrams for the 
2x2 case and within these sections, relative probabilities will behave just as in the 2x2 case. 
An examination of the possibilities, however, indicates that not much is to be gained by a 
detailed treatment. The G.S.M. test for 2x2 comparative trials will be a valid test if applied 
to double dichotomies. It will err somewhat on'the side of * conservatism ^ but the error 
does not appear to be large, except when the numbers involved are exceedingly small. 

It is with a view to further applications of the approach used in this paper that we have 
retained the C condition as a separate requirement, although it is easy to see that it could 
be absorbed into the M condition as we have given it. 


In writing this paper the author has had great personal help and encouragement from 
Prof. E. S. Pearson, to whom he wishes to express Ms very deep thanks. 


Summary 

In Part I we discuss various types of experiment, each of which may give rise to results 
in the form of a 2 x 2 table. It appears that significance tests which may be appropriate 
for one type of experiment will not necessarily be appropriate for another. 

In Part II a test is developed for experiments of the type called ‘2x2 comparative 
trials'. 


APPENDIX 
Tables for the GSM test 

Three tables are given below to illustrate the application of the ideas given in the main paper 
to the construction of a test for 2x2 comparative trials. The cases covered are pairs of 
samples, sizes (7, 7), (8, 6), and (9, 5). The small figures in brackets in the (7, 7) table gives 
significance levels on Eisher's ‘exact' test for 2x2 independence trials, for comparison. 
Only half of the (8, 6) and (9, 5) tables are given; the missing parts can be filled in by 
symmetry. The following examples show the meaning and use of the tables: 
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Example 1. Two boxes, each containing a large number of components, are to be tested 
for comparative quality measured by the respective proportions of defective components 
they contain. Two samples, each of seven components, are taken, at random, one from each 
box. One sample gives four defectives, the other, none. What is the significance of this result, 
in relation to the hypothesis that the boxes have the same quality? 

Answer. Entering the (7, 7) table at the point (0, 4), we find the number 2*4. This means 
that the result is evidence against the hypothesis, on the 2-4 % level of significance. 


Table for m ^ n ^ 1 


7 

0*012 

0*18 

0*70 

2*4 

7*5 

20 

— 

—. 


( 0 - 058 ) 

( 0 - 23 ) 

( 2 * 1 ) 

( 7 - 0 ) 

( 19 ) 

( 46 ) 



6 

0*18 

1*3 

6*7 

13 

— 

«—. 

— 

— 


( 0 - 23 ) 

( 2 - 9 ) 

( 10 ) 

( 27 ) 





5 

0*70 

5*7 

21 

— 

— 

—, 

— 

20 


( 2 ‘ 1 ) 

( 10 ) 

( 29 ) 





(46) 

4 

2*4 

13 

— 

— 

— 

— 

— 

7*6 


( 7 - 0 ) 

( 27 ) 






( 19 ) 

3 

7*5 

— 

— 

— 

— 

— 

13 

2*4 


( 19 ) 






( 27 ) 

( 7 * 0 ) 

2 

20 

— 

— 

— 


21 

5*7 

0*70 


( 46 ) 





( 29 ) 

( 10 ) 

( 2 - 1 ) 

1 

— 

— 

— 

— 

13 

5*7 

1*3 

0*18 






( 27 ) 

( 10 ) 

( 2 - 9 ) 

( 0 * 23 ) 

0 

— 

— 

20 

7*5 

2*4 

0*70 

0*18 

0*012 




( 46 ) 

( 19 ) 

( 7 - 0 ) 

( 2 * 1 ) 

( 0 - 23 ) 

( 0 - 068 ) 


0 

1 

2 

3 

4 

5 

6 

7 


More precisely, what is asserted is, that the maximum probability of getting a result not 
less significant than that obtained, is 0*024. And the results which are not less significant 
are those which correspond to points in the table with numbers not greater than 2*4, viz. 
(0, 4), (7, 3), (0, 5), (7, 2), (0, 6), (7, 1), (0, 7), (7, 0), (1, 6), (6, 1), (1, 7), (6, 0), (2, 7), (6, 0), 
(3, 7), (4, 0). By suitable choice of the proportion defective, we could construct a pair of 
boxes, of equal quality, which would give samples falling in this group 24 times out of 1000, 
on the average; but we could not, by any choice of proportion defective, retain equal quality 
and yet have results in this group more often than 24 times in 1000. 






Table for m 

-8, 

^ = 6 






0 

1 

2 

3 

4 

5 

6 

7 

8 



6 

0*012 

0*18 

0*71 

2*5 

5*3 

13 

— 

— 

—. 

—» 


5 

0*085 

1*3 

6*6 

11 

— 

— 

— 

— 

— 

— 

5 

4 

0*44 

3*9 

19 

— 

— 

— 

— 

— 

— 

— 

4 

3 

1*9 

.16 

— 

— 

— 

— 

— 

— 

— 

6*3 

3 

2 

8*0 

— 

— 

— 

— 

— 

— 

20 

3*8 

0*86 

2 

1 

23 

— 

— 

— 

— 

— 

14 

7*4 

1*3 

0*13 

1 

0 


— 

— 

16 

10 

6*3 

2-3 

0*62 

0*19 

0*012 

0 


0 

1 

2 

3 

4 

6 

6 

7 

8 

9 



Table for m n = 6 


Example 2. The situation is as before, except that the first sample has nine components, 
none of them defective, while the second sample has five components, four of them defective. 
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(i) iNTBODirOTOBY 

1 . The problem of testing the significance of a difference between two proportions is one 
which receives early attention in text-books on mathematical statistics, and it might be 
thought to be one of the questions whose final solution lies behind us. It is a problem whose 
simpKoity makes it easy to examine the logical cogency of the methods put forward for its 
solution, but, on examination, it is evident that they have not yet been rounded off satis¬ 
factorily. The origin of the present paper lies partly in an investigation commenced in 1938 
and discussed at the time in College lectures, and partly in recent correspondence in Nature 
in which G. A. Barnard (1945a, b) and R. A. Fisher (1945a) have taken part.* This 
correspondence has suggested that in a problem of such apparent simplicity, starting from 
different premises, it is possible to reach what may sometimes be very different numerical 
probability figures by which to judge significance. 

2 . Such a difference in levels of significance in the solution of an everyday problem is 
obviously puzzling to the users of statistical methods who are a,ocustomed to accept the 
technique as an established procedure and have not the opportunity for a critical examina¬ 
tion of the conditions under which probability theory is brought to bear as a guide to action. 
For the question here at issue is a fundamental one of why and how our judgement is in¬ 
fluenced by the calculation of a probability, and the dilemma raised by the Barnard-Fisher 
correspondence can only be answered in terms of our views on the practical function of the 
theory. We may all agree that in practice we use probability figures derived from an analysis 
of numerical data to help us to make up our minds on the next step, whether in experi¬ 
mental research or executive action. But what form of presentation of the probability set-up 
is likely to result in the greater number of sound decisions is likely to be always a matter for 
differences of opinion. 

3. All that I can do is to approach the problem of the 2x2 table from the viewpoint 
which appears most helpful to me. In the preceding paper Mr Barnard has elaborated the 

* There was also an, earlier discussion on the same subject between E. B. Wilson (1941, 1942) and 
R. A. Fisher (1941). 
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views expressed in his letters to Nature. Such discussion is, I believe, desirable, even though 
controversial issues are raised. Por the value of the whole elaborate structure of the 
modern theory of mathematical statistics depends at least in part on the sense in which the 
individual statistician appreciates the meaning of the probability model he is using when 
drawing the practical conclusions from his analysis of data. I have used the words 'in part’, 
for it is true that the analytical process of applying the statistical technique to experi¬ 
mental data may in itself be enormously illuminating even without paying any close regard 
to a final probability figure. Such is the case, for example, with the technique of analysis of 
variance, where the mere process of breaking up a total sum of squares into parts with which 
different sources of variability can be associated, brings with it a reward in clear thinking 
even without the application of a probability test. 

4. There is a very wide variety in the types of situation in which probability theory is 
introduced to help in reaching a decision as to further action. 

(A) At one extreme we have the case whei^e repeated decisions must be made on results 
obtained from some routine procedure carried out under controlled conditions. 

(B) At the other is the situation where statistical tools are applied to an isolated investiga¬ 
tion of considerable importance in which many of the issues involved in the conclusion can 
hardly be assessed in numerical terms. 

5 . Two situations of this kind, in which the statistical technique involved is that of testing 
the significance of a difference between two proportions, may be illustrated from problems 
arising in the 'proof’ of armour-piercing shot or shell. 

6 . Example of type A. In the proof of small anti-tanli, armour-piercing shot it might be 
decided to set aside, as a standard, a batch of shot whose quality has been established by 
special trials; against this standard, later batches can be compared. The variable measured 
is the proportion of shot which fail to perforate a plate of specified thickness when fired with 
a given striking velocity. The use of standard shot is necessary for calibration purposes, 
because there are inevitable changes in toughness from one proof plate to another and only 
a limited number of shot can be fired at a single plate. Then the situation might be summed 
up as follows;* 

Aim of proof. To ensure that as few batches as possible are passed into service which 
are less effective than the standard. 

Method of proof. Twelve rounds of the standard and twelve of the batch under test to he 
fired, round for round, against a single test plate and a record kept of the number of failures 
in each group, say a and b. 

Routine sentencing rule. This should lay down a ready means of determining, from a 
knowledge of a and 6, whether to class the new batch as inferior to the standard or not. 

Assumptions accepted in using rule. That the two samples of twelve shot have each been 
randomly selected from the much larger batches. That against the particular plate used, a 
proportion of the standard and new batch would fail to give satisfactory per¬ 

foration at the specified striking velocity. That while and _252 would be different for other 
plates, iip 2 >Pi for one plate, it will be so for all other plates. The objective is to segregate 
batches of shot for which p^ > p^. 

* It been somewhat simplified for illxistrative purposes, e.g. complete control of the striking 
velocity IS not in practice possibloi 
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7. Example of type B. Two types of heavy armour-piercing naval sheE of the same 
calibre are under consideration; they may be of different design or made by different firms. 
Since the cost of producing and testing a single round of this kind runs into many hundreds 
of pounds, the investigation is a costly one, yet the issues involved are far reaching. Twelve 
shells of one kind and eight of the other have been fired; two of the former and five of the 
latter failed to perforate the plate. In what way can a statistical test contribute to the 
decision which must be taken on further action? 

8. In dealing with Example A the guiding principle followed in seeking help from the 
theory of probability can be very simple. We can set as our object a rule which; 

(i) will result in an increasing chance of detecting that ^2 ^Pv larger the difference; 

(ii) will leave only a small chance of segregating the newbatch wrongly when, infaot, 2^2 

Diagrammatioally the rule would consist in segregating the new batch when the point (a, b) 

falls within some such area as that shown shaded in 
Fig. 1. In this problem involving a routine pro¬ 
cedure, it is the long-run frequency of different con¬ 
sequences of the proof sentencing which is of 
importance, and probability theory is introduced to 
provide a measure of expected frequency. This 
method of introducing the theory of probability into 
this proof problem is not necessarily the only one b 
that could be adopted in fixing a routine procedure, 
hut it is a simple one and, since simplicity has the 
merit of appealing to the user’s understanding, it has 
great advantages. 

9. When dealing with Example B a very con¬ 
siderable number of factors must be weighed in 
the balance, and the result of a statistical test of 
significance could never be the over-riding one. 

There will be other information as to the effect of changes in shell design, possibly from 
shell of different calibre; information as to the uniformity in. quality of output of the 
firm or firms concerned; questions of cost and of general policy. He would be a bold man who 
would attempt to express these in numerical terms. Whereas when tackling problem A 
it is easy to convince the practical man of the value of a probability construct related to 
frequency of occurrence, in problem B the argument that ‘if we were to repeatedly do so 
and so, such and such result would follow in the long run ’ is at once met by the common- 
sense answer that we never should carry out a precisely similar trial again. 

10. Nevertheless, it is clear that the scientist with a knowledge of statistical method 
behind him can make his contribution to a round-table discussion, provided he has 
acquired a grasp of the practical issues. Starting from the basis that individual shell will 
never be identical in armour-piercing qualities, however good the control of production, 
he has to consider how much of the difference between (i) two failures out of twelve and 
(ii) five failures out of eight is likely to be due to this inevitable variability. There may be a 
number of ways of sizing up the position involving different assumptions or hypothetical 
constructs; he may follow one or several of these, The value of his advice is dependent almost 
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entirely on the soundness of his scientific judgement, and very little on whether his back¬ 
room calculations have been based on inverse or direct probability or on an appeal to 
fiducial argument. 

11. How far, then, can one go in giving precision to a philosophy of statistical inference? 
It seems clear that in certain problems probability theory is of value because of its close 
relation to frequency of occurrence; such seems to be the case for my Example A. Tests can 
bo built up to satisfy the practical requirements in this field. In other and, no doubt, more 
numerous cases there is no repetition of the same type of trial or experiment, hut all the 
same we can and many of us do use the same test rules to guide our decision, following the 
analysis of an isolated set of numerical data. Why do we do this ? What are the springs of 
decision? Is it because the formulation of the case in terms of hypothetical repetition helps 
to that clarity of view needed for sourvd judgement? Or is it because we are content that the 
application of a rule, now in this investigation, now in that, should result in a long-run 
frequency of errors in judgement which we control at a low figure? On this I should not care 
to dogmatize, realizing how difficult it is to analyse the reasons governing even one’s own 
personal decisions. 

12. That the frequency concept is not generally accepted in the interpretation of statis¬ 
tical tests is of course well known. With his characteristic forcefulness R. A. Fisher (19456) 
has recently written: ^In recent times one often repeated exposition of the tests of signi¬ 
ficance, by J. Neyman, a writer not closely associated with the development of these tests, 
seems liable to lead mathematical readers astray, through laying down axiomatically, wffiat 
is not agreed or generally true, that the level of significance must be equal to the frequency 
with which the hypothesis is rejected in repeated sampling of any fixed population allowed 
by hypothesis. This intrusive axiom, which is foreign to the reasoning on which the tests of 
significance were in fact baaed seems to be a real bar to progress... 

13. But the subject of criticism seems to me less an intrusive mathematical axiom than 
a mathematical formulation of a practical requirement which statisticians of many schools 
of thought have deliberately advanced. Prof. Fisher’s contributions to the development of 
tests of significance have been outstanding, but such tests, if under another name, were 
discovered before his day and are being derived far and wide to meet new needs. To claim 
what seems to amount to patent rights over their interpretation can hardly be his serious 
intention. Many of us, as statisticians, fall into the all too easy habit of making authoritative 
statements as to how probability theory should be used as a guide to judgement, but 
ultimately it is likely that the method of application which finds greatest favour will be that 
which through its simplicity and directness appeals most to the common scientific user’s 
understanding. Hitherto the ug^er has been accustomed to accept the function of probability 
theory laid down by the mathematicians; but it would be good if he could take a larger share 
in formulating himself what are the practical requirements that the theory should satisfy 
in application. 


(ii) The choice oe statistical tests 

14. One approach to follow in determining tests to he applied to the 2 x 2 class of problem 
follows the lines that Neyman and I have adopted since 1928 in dealing with tests of statis¬ 
tical hypotheses. Let me first recapitulate in broad terms the steps in that approach when 
applied to a problem where the universe of possible observations can be represented by a 
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finite set of discrete points. A test of significance may be described as a method of analysis 
of statistical data which helps us to discriminate between alternative theories or hypotheses. 
In order to make use of the theory of probability in the sense here understood, a random 
process must either have been purposely introduced .or be assumed to have been present in 
the collection of data; then the hypothesis very often concerns the values of parameters 
contained in the probability laws which, in the conceptual sphere, form, the mathematical 
counterpart of the sampling distributions of experience. 

15. We proceed by setting up a specific hypothesis to test, in Neyman’s and my 
terminology, the null hypothesis in R. A. Fisher’s. At the same time, in choosing the test, we 
take into account alternatives to which we believe possible or at any rate consider it 
most important to be on the look out for. Thus we wish the test to have maximum dis¬ 
criminating power within a certain class of hypotheses. Three steps in constructing the test 
may be defined: 

Step 1. We must first specify the set of results which could follow on repeated application 
of the random process used in the collection of the data; this may be termed the experi¬ 
mental probability set. 

Step 2. We then divide this set by a system of ordered boundaries or contours such that 
as we pass across one boundary and proceed to the next, we come to a class of results which 
makes us more and more inclined, on the information available, to reject the hypothesis 
tested in favour of alternatives which differ from it by increasing amounts. 

Step 3. We then, if possible, associate with each contour level the chance that, if is 
true, a result will occur in random sampling lying beyond that level. 

This rather crude statement of procedure will be developed in more detail in discussing 
the problems that arise in connexion with the 2x2 table. 

16. Notes on these points, (a) Step 1, This involves the definition of what Neyman and 
I have termed the sample space, W. The application in three forms of the 2x2 problem is 
discussed in paragraphs 19, 27 and 46 below. 

(6) Step 2. For a given hypothesis under test there may be a number of ways of deriving 
a system of contours, and only in certain cases can there be said to be complete agreement 
on which is the ‘ best ’. Practical expediency will often carry weight in the choice. It is widely 
accepted that the choice cannot be made without paying regard to the admissible hypotheses 
alternative to whether this process is given formal precision or taken as a broad guide. 
In our first papers (Neyman &; Pearson, 1928a, 6) we suggested that the likelihood ratio 
criterion, A, was a very useful one to employ in determining a family of contours which 
would be ordered in relation to our confidence in the hypothesis tested when set against 
the background of admissible alternatives. Thus Step 2 preceded Step 3. In later papers 
(Neyman & Pearson, 1933, 1936 and 1938) we started with a fixed value for the chanCe, e, 
of Step 3 and determined the associated contour, taking account of what we termed the 
power of a test with regard to the alternative hypotheses. The family of Step 2 followed 
on giving decreasing values to e. However, although the mathematical procedure may 
put Step 3 before 2, we cannot put this into operation before we have decided, under 
Step 2, on the guiding principle to be used in choosing the contour system. That is why 
I have numbered the steps in this order. 

(c) Step 3. If this can be accomplished, we have what Neynian and I called control of the 
Tst kind of error ’. In problems where, as below, we are concerned with discrete rather than 
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continuous probability distributions (e.g. for the binomial, the Poisson, the multinomial 
and the hypergeomotric distributions), this objective cannot always be achieved, and it 
may be necessary to be satisfied with a knowledge of an upper limit of the chance of rejecting 
the hypothesis tested when it is true* 

(iii) Applioation Ob’ this approaoh to the analysis of data classed in a 2 x2 table 

17. The frequencies of the data in the table may be defined in the following notation; 

Table 1 



Col. 1 

Col. 2 

Total 

Bow 1 

a 

c 

m i 

Row 2 

b 

d 

n 

Total 

T 

8 

N 


If we foUow in turn the steps defined above to determine the method of interpretation of 
such data, the requirements of the appropriate tests are seen to follow very simply, although 
matheinatical or computational difficulties arise in implementing them. On taking Step 1 
we can separate out at once the three types of problem which Barnard has differentiated;* 
these I shall call Problems I, II and III. They are distinguished by the sample space having 
1,2 and 3 dimensions respectively. From the mathematical point of view it might seem more 
logical to take them in the reverse order, adding first one and then a second restriction to 
the S-dimensioned case of Problem III. For a simple exposition, I think the Reverse procedure 
of building up from I to III is preferable and this has been adopted in the following sections. 

(iv) Problem I 

18. This may be described as the test of the significance of the difference between two 
treatments after these have been randomly assigned to a group of N ^ individuals 
(Barriard terms it the 2x2 independence trial). To use the terminology of a particular 
application, we may say that we are observing the presence or absence of ‘reaction X\ 
The first treatment is applied to m and the second to n of the N individuals;, as a result ajm 
and bjn show reaction X. 

19. In this case the random process has been applied within the group of N individuals, 
and its repetition would simply involve other random reassignments of the two treatments 
among the N. No assumption is made as to how the N individuals were selected from some 
larger universe. The repetition may be hypothetical, in the sense that it often could not 
take place, e.g. if reaction Z = death. Indeed, repetition under the same essential conditions 
is frequently impossible in practice. But this correspondence between the frequency of 
results upon hypothetical repetition and the probability distribution of the counterpart 
mathematical model forms an accepted part of the process of reasoning whereby (following 

^ Statisticians had, of course, all been more or less conscious of these differences, but, at any rate 
in my own case, it was discussion with Mr Barnard which made it easy to see the problem in its full 
clarity. 
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the present approach) we use probability theory as a basis for inference. The hypothesis 
tested is that while some individuals show reaction X and some do not, the result would be 
the same whichever treatment were applied as far as these N individuals are concerned. Thus, 
on the null hypothesis, there are r = a + b individuals who will react and s = c + d who will 
not, whatever the assignment of treatments. 


20. The chance that a will react in m and & = r—a in to is, therefore, if the hypothesis 
be true, 

TD r I AT 1 m\n\rls\ 

I -irerawr 


( 1 ) 


This expression is proportional to the coefficient of in the hypergeometric series 

F[cc,p,y,x) =: F{-t, -m,n-T+l,x). (2) 

Thus, taking m'^n^a can assume values of 

(i) 0, 1, if 

(ii) r —-n., ...,r if 

(hi) r — Ti, r —w + m if r>m. 

For this probability distribution, it is known (K, Pearson (1899) and Kendall (1943, p. 127)) 



that Meana==-^, (S) 

Variance of a = 0 -^ = ^^^. (4) 

21. For the particular case 

^ = 20, r = 7, m = 12, ^ — 8, 

the terms in the distribution of P-^{a | 20,7,12} are shown as ordinates in Fig. 2 and given in 
the accompanying Table 2. The experimental probability set consists of the eight alter¬ 
native values for viz. 0,1,,.., 7 with which the probabilities tabled are associated if is 
true. Further 
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Meana = a — 4*2, cr^ = T0721. 


10 


(5) 
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22. Next consider step 2. The purpose of the investigation is to test the hypothesis 
that the difference between a/12 and (r-a)/8 has resulted simply from a random 
partition of 20 individuals, of whom r will show reaction X in whichever treatment 
group they are included. The experiment gives The contour levels fall beWeen 

the 8 points of the set as shown in Fig. 2; the further a lies towards the right, the more 
inclined we shall be to accept the alternative hypothesis that a/12> (r-a.)/8 because 
treatment 1 is more effective than treatment 2. The further a lies to the left, the more 
we shall incline towards the reverse alternative. To complete Step 3, we have only to 
calculate the sums of the tail terms of the hypergeometrio series, as shown in Table 2 for 
the special case. 


Table 2. Problem /. Chances for special case N — 20^ r — 1, m — 12, if Hq is true 


a 

Chance 
of a 

Chance of a or less 

True value 

Normal approx. 

0 

0-0001 

0-000 

0*000 

1 

0-0043 

0*004 

0*006 

2 

0*0477 

0*062 

0*056 

3 

0-1987 

0-251 

0-257 

4 

0*3676 

• “ 

— 



Chance of a or more 



True value 

Normal approx. 

5 

0*2861 

0-392 

0*390 

6 

0*0954 

0*106 

0*113 

7 

0*0102 

0*010 

0*016 


23. Having set up the machinery of the test, we come to the practical question. Beyond 
which contour levels must a fall before we infer that there is a treatment difference ? Not, 
I think, in the example, ifa were 3,4 or 5; possibly if a == 6, more probably if a = 2and almost 
certainly if a = 0,1 or 7. Were we to fix as critical levels those between a — 1 and 2 on the 
one hand, and between a = 6 and 7 on the other, then we should be guided in our decision 
by the following knowledge: if there were no treatment difference, so that seven out of the 
twenty individuals would have shown reaction X whichever treatment wore applied, then 
the chance under random assignment of treatments that a < 2 or > 6 is only 0-014 or 1 in 70, 
Had we taken the critical levels between 2 and 3 and between 6 and 7, the corresponding 
chance would be 0*062 or 1 in 16, This summing up in terms of probability helps towards 
the balanced decision on the next practical step to be taken, because it helps us to assess the 
extent of purely chance fluctuations that are possible. It may be assumed that in a matter 
of importance we should never be content with a single experiment applied to twenty in¬ 
dividuals; but the result of applying the statistical test with its answer in terms of the chance 
of a mistaken conclusion if a certain rule of inference were followed, will help to determine 
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the lines of further experimental work and the degree of confidence with which we proceed 
provisionally to adopt a new technique. 

24. An experiment falling under this head has the advantage that the random process 
introduced is under complete control. The analysis will give an answer in probability terms 
whether the N individuals have been randomly selected from a larger whole or not. But this 
answer is limited in the sense that it relates only to the A; if we wish to draw conclusions 
about a wider population or populations, then a random selection of the N or, separately, 
of both its parts m and n is needed. Thus we come to Problems II and III. 

26. Approximation to the hypergeometric terms. When dealing with small numbers, the 
calculation of the tail terms of the series may not be laborious, but it soon becomes so when 
r is large. An obvious approximation is that obtained by using an integral under the normal 
curve with the mean and standard deviation of equations (3) and (4) to represent the sum 
of the hypergeometric terms. As usual when approximating to the sum of the terms for 

= a4-l, a + 2, etc., of a discrete probability distribution by the integral under a 

continuous curve, we take this integral from the point x ^ a — Thus Pig, 3 shows the 

normal curve i 

= ^p^y^exp[-|(a:-a)V(r|], (6) 

with a and cr^ as in equations (5), and the approximation to the sum of the hypergeometric 
terms for a = 6 and 7 is roo 

p{x) dx, 

J 6.6 

represented by the area marked with cross-hatching. The approximations for different 
levels are shown in Table 2, and are seen in this case to be quite adequate for the purpose of 
the test. Further comparisons are made in the Appendix, and it appears that provided m 
and n are fairly nearly equal, as they are likely to be in most planned experiments of the 
Problem I type, the normal approximation is surprisingly good. Yates (1934) has suggested 
a method of further correction. 

26. The correction for continuity. In the 2x2 table connexion, the improvement obtained 
by taking the normal integral (i) from cc = a — if a > a or (ii) from a; = a + ^ if a < a (so that 
we are summing for the lower tail), was pointed out by Yates (1934) and has often been 
termed ‘Yates’s correction for continuity’. It is, however, the natural adjustment to make 
on the basis of the Euler-Maclaurin theorem, when approximating to a sum of ordinates 
by an integral and without wishing to detract from the value of Yates’s suggestion in this 
particular problem, it should be pointed out that the adjustment was used by statisticians 
well before 1934, when employing a normal or skew curve to give the, sum of terms of a 
binomial or hypergeometric series.* 


(v) PrOBTjEM II 

27. This may be described as the test of whether the proportion of individuals bearing a 
character A is the same in two different populations, from each of which a random sample 
has been drawn, i.e. the test of the hypothesis that 

p^{A)= p, ( 7 ) 

* The method was in use in the Department of Applied Statistics when I joined the staff in 1921, 
and may have been current many years before that. 
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where p is some common but unspecified proportion. Barnard describes this as the case of 
the 2x2 comparative trial. Here m individuals have been drawn at random from the first 
population and n from the second, and it is found that ajm and bjn, respectively, bear the 
character A . The conditions are assumed to be such that if the random procedure of selection 
were repeated, the appropriate probability distributions for a and b would be given by the 
terms of binomial expansions. Table 3 shows the observed results. 


Table 3 



No. with 
character A 

No. 

without A 

Total 

let sample 

a 

0 

m 

2nfl sample 


d 

n 

Total 

r 

s 

N 


S> 



Fig. 3. The curves ABO and A'BV' represent the significance contours and respectively. 


^ Problem there have been two applications of a random selection process, not one 
as tor Problem I, and the experimental probability set consists of the (m + 1 ) { 7 ^ + 1 ) alter- 
na ive va ues 0 the doublet 6) (0 < a < m, 0 ^ 6 ^ n) which can be represented in the lattice 
lagrain s own in Mg. 3 for the special case m = 12, n ^ 8. It might, of. course, be argued 
K ^ fT* T 4 .U repetition of the selection process m and n need not remain constant, 

u IS, m c, wou d introduce an unnecessary complication into the probability set-up. 
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28. The question before us is whether the result (a, h) is consistent with the hypothesis 
Hq defined in equation (7) above, or whether it suggests that either Pi>P 2 or that Pi<P 2 * 
A little reflexion shows that we have no reason to reject Hq if the point (a, b) lies near the 
diagonal line on which ajm — bjn, but, broadly speaking, are more and more likely to do so 
the farther the point falls from this line in the direction of the corners (0, n) and (m, 0) of the 
lattice diagram. This statement requires amplification. In defining the significance contours 
we may consider the following question: If is not true, what departures from equality 
in Pi and p^ do we regard it of equal importance to detect? Should the power of the test be 
roughly the same for constant values, for example, of 

(a) {b) or (c) 

The procedure which I have adopted in the sections which follow is frankly one of ex¬ 
pediency. I have not considered in detail how to choose a family of significance contours 
satisfying requirements formulated in advance, but have taken those suggested by the 
customary large-sample procedure which gives contours of the form ABO, A'B*G' drawn 
in Fig. 3, These will, I believe, make the power of the test to detect a difference more nearly 
dependent on the ratio of the odds given by (c) than on either of the expressions (a) or (6). 
E. B. Wilson (1941) chooses the expression (a). This point, however, needs further investiga¬ 
tion. It should be noted that a similar problem, in the case where the sampling distribu¬ 
tions follow the Poisson law, was discussed very fully by Przyborowski & Wilenski (1939). 


29, Besides involving a 2-dimensional instead of a 1-dimensional experimental pro¬ 
bability set, Problem II differs from Problem I in that we need an answer which is indepen¬ 
dent of the unknown common probability p of the null hypothesis. In Problem I the part 
of p was played by the fraction r/N given by the data. We are concerned now with what 
Neyman and I (Neyman & Pearson, 1933) have termed a composite hypothesis, and were 
it possible would like the contour levels to bound regions which are ‘ similar to the sample 
space with regard to the parameter p' (loc. cit. p. 313) (i.e, are independent of p). The 
following considerations show the lines along which a first attack of the problem can proceed. 

30. If Hq is true and equation (7) holds, then the probability of the observed result may 

be written'*' . 

m. ... At. 


P^{a I^3.m} X P^{b \p,n]^ ^ ' 


N\ 




m! n\r\8\ 


a\b\c\d\N\ 
Pi{r \p,N}x Pi{a 1 N, r, m}. 


( 8 - 2 ) 

(8-3) 


Thus the probability of obtaining the doublet (a, b) in sampling from two populations with 
a common p may be regarded as the product of two terms: 

(i) The probability that a + 6 = r or that the point (a, b) in Fig. 3 falls on a diagonal line 
on which r = constant. This probability, Pi{r\p,N}, is the (r + l)th term in the expansion 
of the binomial 


((1-^3)+i3)^. 

(ii) The relative probability, given r, of the observed partition into a and b = r--a; this 
is independent of p and is identical with the expression Pi{a | W, r, m) of equation (1), i.e. is 
proportional to a term of the hypergeometric series (2), 

* It will be seen that Pi{ } has been used to denote a hypergeometric probability and Fi{ } a 
binomial probability. 
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31. If, now, it were possible to draw a boundary line such as ABO shown in Fig. 3, 
cutting off at the end of each diagonal, r = constant, a group of points {a, r - a) such that 

S[Pi{aK,r,m}] = e, (9) 

a 

where e is a fraction between 0 and 1 chosen at will, then the requirement of Step 3 would 
be satisfied. For in rejecting when {a, h) fall beyond this boundary,* the chance of doing 
so if ifo were true would be 

S [Pfr I P, iV} X e] = e X S {P^{r | N}] = e, (10) 

r«0 

i.e. would be independent of the unknown common p of the hypothesis tested. The test 
would then be analogous to ‘Student^s^ test for the significance of the difference between 
two means, where we have a system of contour levels each associated with a chance e, 
independent of the values of any unknown parameters which are irrelevant to the com¬ 
posite hypothesis tested. 

32. Unfortunately, this objective cannot be achieved because we are not dealing with 
contiauous probability distributions and Pi[a | N, r, m} exists only at discrete, integral 
values of a. If we follow the present line of approach, all that is possible is to take contour 
or significance levels which cut off from an end of each diagonal, r = constant, a group of 
points for which 

S [Pfa I N, r, m}] = /?,.< e. (11) 

a 

Then, in rejecting when (a, b) falls beyond such a contour, we know that the chance of 
doing so, if is true, will be 

i[P,{r\p,N}xf,]^e. (12) 

It is clear that the amount by which the probability falls below e will be a function of p, 
and that in taking Step 3 we are only associating with each significance level an upper 
limit, e, to the probability of rejecting Hq when it is true. 

33. We have still, of course, to determine the most appropriate system of significance 
levels and to set out a ready means of finding an upper limit, e, associated with the level on 
which an observed doublet (a, b) falls.f Mr Barnard has broken new ground in 

(i) defining for this Problem II one systematic method of determining a family of levels 
Lg based on certain clearly defined principles; 

(ii) determining the true upper bound to the associated probability e which, in the case 
of small samples at any rate, may be considerably below that which has hitherto been used. 

Since, however, much tabUng is heeded before his theoretical advance can be followed 
by a practical working rule available for samples of any sizes, m and n, I think it is worth 
while describing the cruder handling of the lattice diagram which I had discussed in 1938-9 


* There would be a similar series of boundaries, below the diagonal aim = &/n, such as A^B^C' 
of Fig. 3. 

f The likelihood ratio A might be used in determining the fanaily of significance contours, as was 
suggested in connexion with the general problem (Neyman & Pearson, 19286, p. 283). In large 
samples A would approximately equal e-^\ where u is given by equation (22) below. 
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lectures. This involves, perhaps, not much more than a restatement of what may be termed 
the classical approach to Problem II (see paras. 43 and 44 below), but it does bring out the 
difference between Problems I and JI, which I think important. 

34. It may h6 well to emphasize here that this distinction between the handling of 
Problems I and II is not universally accepted, Fisher has set out his approach as follows in 
a paper read before the Royal Statistical Society (1935):' To the many methods of treatment 
hitherto suggested for the 2x2 table the concept of ancillary information suggests this new 
one. Let us blot out the contents of the table, leaving only the marginal frequencies. If it 
be admitted that these marginal frequencies by themselves supply no information on the 
point at issue, namely, as to the proportionality of the frequencies in the body of the table, 
we may recogrdze the information they supply as wholly ancillary; and therefore recognize 
that we are concerned only with the relative probabilities of occurrence of the different ways 
in which the table can be filled in, subject to these marginal frequencies.^ 

This view has also be^n supported by Yates (1934). As I understand it, Fisher would refer 
the observation (a, b) to a linear set (as in my Problem I), however the data have been 
collected; this attitude follows readily if we discard the requirement that the probability 
distribution used in the test must be related to the frequency distribution that would be 
generated by repeated application of the random sampling process employed in the experi¬ 
ment. It will be seen that with Fisher’s approach there is a gain in simplicity in handling 
the analysis; it must remain a matter of opinion whether there is a loss in the relevance 
of the probability construct to the question at issue. It is, of course, only when handling 
small samples or in cases where (u, 6) lies close to one of the corners (0,0) or (m,n) of the 
lattice that this need for choice between probability constructs is thrust upon us. 


(vi) Solution of Problem II, using the normal approximation 

36. If the samples are large, the calculation of hypergeometric terms becomes laborious 
and we turn naturally, as in so many other statistical problems, to the approximation using 
the normal curve. In fact, except when r or s are very small or m and n very different in 
magnitude, the normal curve with mean and standard deviation given by equations (3) 
and (4) provides a surprisingly good approximation to the relative probabihty distribution 
of a for fixed r, viz. | iV, r, m} (see Appendix). Define as the deviate of the standardized 

normal curve for which 


Then we can draw across the lattice diagram a significance level above and another L' 
below* the diagonal aim = bjn such that 
(i) all points (a, 6) for which 


lie beyond, i.e. above, 

(ii) and all points (a, b) for which 


lie beyond, i.e. below, L^. 


(14) 


( 15 ) 


* Tho words ‘above’ and ‘below’ are used in the sense of Figs. 3 and 4. 
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If we wish to take special action either when ajm is significantly less than bjn or significantly 
greater, then we shall use both levels 4 and 4 ; if only, however, when ajm <b/n, then we use 
The corresponding probability levels would be obtained by making e for the second case 
twice its value for the first. Fig. 4 shows the 247 relative probabilities Pfa | N, r, m} for the 
case m = 18, 72 . = 12 . The unbroken, stepped lines are two contour levels determined in this 
way. Purely for convenience in drawing, the level with e = 0*05 and 7 ^ 0.03 = 1*6445 has been 
put above the diagonal and that with e == 0*01 and == 2*3263 below. 

36. If the normal approximation to the hjrpergeometric series were correct, it would 
follow that along every diagonal, r = constant, the sum of the relative probabilities for 
points above 4 would satisfy the inequality ( 11 ). Hence the inequality ( 1 2) for the complete 
area of the lattice above would hold, whatever the value of the common^. A similar result 
would hold for the area below 4 * Cf course, the normal approximation will not hold pre¬ 
cisely, particularly when r or 3 are small, but here we shall generally be on the safe side, in 
the sense that the hypergeometrio distribution is flat-topped with-abrupt ends so that the 
4 of equation ( 11 ) will be considerably less than e, and often zero. 

37 . It is interesting to examine the results set out in Fig. 4 with the help of the detailed 

calculations given in Table 4. Columns (2) and (3) give, for constant r, the mean and standard 
deviation of | 30, r, 18}, while columns (4) (for 4 * 05 ) (^) 4-oi) the cut-off 

points defined by the normal approximation, i.e. 

and 0^2 = a + (1^) 

The sums of the relative probabilities Pi{a ] 30,r, 18} for a^ai and are given in 

cols. (5) and (9) respectively. Thus, for example, for r — 7 

= 4*2-0*6-1*6449 X M543 = 1*80, 
and the sum of the probabilities for a = 0 and 1 is 

0*0004-h 0*0082 = 0*0086. 

These are the tail sums, termed 4 equation (11). It is clear from an examination of 
cols. (6) and (9) that they are all less, and many of them very much less than 0*06 and 0*01. 
This is inevitable with a discrete distribution containing few terms. The contour levels have 
been drawn conventionally in Fig. 4 as steps passing through the half-integer points and 
not. through the cut-off points of cols. (4) and (8). Clearly, whichever way they are drawn, 
they will separate off the same subset of the (m-f 1) ( 72 .+ 1) points in the lattice diagram 

38. The next question is this. If we were to use either of these levels, what in fact would 
be the chance of the sample doublet (a, 6) falling beyond, if the null hypothesis were true? 
This win depend on the common value of The product suras 

iV r ATI “1 

S [P,{r b. X = S -pY X y?, (17) 

j.ss=0L,'loJ 1 

obtained by multiplying the expressions in cols. (6) and (9) of Table 4 by the appropriate 
binomial terms are shown for a variety of values of jp in Table 6, cols. (2) and (3). It is clear 
at once how far on the safe side we are in saying that these chances are < 0-05 and O'Ol 
respectively. Similar calculations were carried out for a second example, taking m — n = 10, 
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Mg. 4. Hypergeometrio probabilities in lattice diagram for m= 18, n— 12. 


fAbove diagonal, £ 0 . 05 : — ■ -with i adjustment;-without i adjustment. 

Normal curve approximations to significance levels diagonal, -with i adjustment;-without i adjustment. 






Table 4 . Significance levels for case m 
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and the results are shown in Table 5, cols; (6) and (7). In this case, the actual chances of 
(a, 6) falling on or beyond the significance levels are even further below the nominal limits 
of 0*05 and 0*0L In fact, it becomes clear that in the case of small samples, at any rate, this 
method of introducing the normal approximation gives such an overestimate of the true 
chances of falling beyond a contour as to be almost valueless. 

Table 5. Showing the difference between nominal and actual significance levels 



1st example: m = 18, n = 

12 

2nd example; m = 10 = 

n 


p 

Method 1 

Method 2 

Method 1 

Method 2 

P 


. 







{ifH, 

true) 

True chance of 

True chance of 

True chance of 

True chance of 

(if Ho 
true) 

falling on or 

falling 

on or 

falling 

on or 

falling 

on or 


beyond 

beyond 

beyond 

beyond 



I ' 0*06 

i/'o-Ol 

■^ 0-06 


•^ 0-05 

-^^ 0*01 

■ 2 ^ 0’06 

I/'o.Qi 


.(1) 

(2) 

(3) 

(i) 

(6) 

(6) 

(7) 

(8) 

(9) 

.... 

(10) 


0*0010 

1 





0*0069 




0*0064 






0-0261 




0-0141 


0*0483 




0*0456 


0*2 


0*0174 


0*0490 



0*0014 

0*0496 

0-0058 







0*0062 


0*0546 

0*0062 

0*4 


0*0219 













0*0119. 


Repeat as for 1 — n 




19 



0*0120 





0-7 


! \« 



0*0113 







1 H 

0*0009 


0*0052 





0-9 

0*96 

H 



0*0010 







39. Before considering a second method, it will be useful to recapitulate certain character¬ 
istics of what I have termed Method 1. It provides for any nominal value of e one systematic 
procedure of defining a critical boundary, or significance level cutting off a region from the 
lattice diagram. Neither the subgroup of points cut off, nor the sum of the probabilities 
associated with them fox a givenp, will alter continuously with e; they will change by discrete 
steps as the cut-off point, defined in para. 37, passes through a point (a, 6). While we shall 
sometimes want to know whether the observed (a, 6) falls beyond a level specified in 
advance, more often we shall ask what is the level on which (a, 6) falls. This, using Method 1, 
we find by calculating 


u = 


(Tn 


if a<a or 


11 '■ 


if a>a, 


(18) 


and finding e from the norihal integral of equation (13). In this way the nominal chance e 
wiU be a little nearer the true upper limit than the figures in Table 5 suggest,* but not enough 
to modify the criticism expressed above. 


’ * It will be seen from Table 4 that no point (a, b) gives cols. (6) and (9) of exactly 0*06 or 0*01, 
respectively, so that no points actually lie on Xfo *06 or 
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40. Method 2.. The introduction of the correction of i for continuity is certainly ap¬ 
propriate in using the normal approximation to the hypergeometric series in Problem I, 
but I think it is not helpful in Problem 11 where we are concerned with a 2-dimensional 
experimental probability set. If instead of obtaining significance levels and as in 
paras. 35-37, we obtain them from inequalities similar to (14) and (15) but with the correc¬ 
tion of I omitted, then there are several points to be noted: 

(a) Por the significance level L^, the expression 

A. = (19) 

a 

where the summation is for values of a on the diagonal, r — constant, for which 


= a —'WsX£r„ (20) 

will be sometimes less and sometimes greater than e. Hence, in the balance, it seems likely 
that the chance of the point [a, b) lying beyond Lg or 

will lie closer to e than when the ^ correction is used. The position will be the same for L'. 

(&) In drawing repeated samples of m and n from two populations in which there is a 
common chance, p, of an individual possessing character A, the ratio 


u —' 


a —a 


O',, 


a — rmjN 
j mnrs 


( 22 ) 


has, whatever he (i) an expectation of zero, (ii) a unit standard deviation.* The shape of 
the distribution will, of course, depend on^p, hxit^faut de mieux^ we may not in the long run 
do too badly by assuming it to be normal. It is, of course, the weighted combination of a 
number of hypergeometric series whose shape depends on r, 


41. Consider the result of applying this Method 2 to the case m = 18, n== 12 already 
discussed. The procedure for determining the 0*05 and 0-01 significance levels will be exactly 
as under Method 1, except that the continuity correction of ^ is omitted. The resulting levels 
are shown as dashed, stepped lines in Pig. 4.*]* They fall, on the whole, inside the significance 
levels obtained by Method 1. Now turn to Table 4, where cols. (6) and (10) show the cut-off 
points a halfunit further in towards the diagonal ct/m = bln. Cols. (7) and(ll)give the values 
of/?,.; some of these are considerably above the nominal values of e = 0*05 and 0-01, others 
are still well below. But from the approach to Problem II that has been adopted, this is 
immaterial since the experimental probability set is the 2-dimensioned one of the lattice 
diagram and is not restricted to the diagonal r = constant on which the observed point 
(a, b) may happen to lie. What we are concerned with is the summed chance given by expres¬ 
sion (21) and the value of this is given for eleven values of ^ in cols. (4) and (5) of Table 5. It 
will be seen that this true chance does sometimes exceed the nominal values of 0*05 and 0*01, 


* Provided cases where r or 5 are zero, making the expression (22) indeterminate with w. = 0/0, 
^e exc u ed. Mr Barnard has pointed out that one way of avoiding this exclusion’would be to lay 
down that, when w = 0/0, we assign to the ratio a value chosen at random from a population (say 
normal) with zero mean and unit variance. 

t Again, for convenience the 6 % level is di'awn above, and the 1 % level below the diagonal. 
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but never by very much. Again, for the second example with (Table 5, cols. 

(8) and (9)) the true chance, while it sometimes exceeds the nominal value, is always con¬ 
siderably nearer it than using the significance levels of Method 1. 

42. It is clear that no final conolusions can be based on two numerical examples, but it 
seems that the test of the null hypothesis in Problem II should be carried out as follows: 

[а) When w, n,r ox s are small, with the help of tables prepared on Barnard's Lines, based 
on an ordered classification of the points in the lattice diagram, and giving the true upper 
bound of the chance that a point (u, b) falls on or beyond the level on which the observed 
result lies. The particular basis of his classification may, of course, be modified. 

(б) When m, n, r and s are large, by assuming that the u of equation (22) is a normal 
deviate with unit standard deviation. 


(vii) The classical approach to Problem II 

43. It has recently become customary to regard the test of significance applied to data 
given in a 2 X 2 table as the limiting case of a Lest with one degree of freedom. But Problem 

II was originally answered in somewhat different terms. It was noted that if 

=i52(^) (23) 

then the fractions ajm and bjn would both have expectations of p and variances of p(l — ■p)lm 
andp(l —p)ln, respectively. Hence, if the null hypothesis were true, the difference 




m 


b 

n 


would have 


mean d — 0 




p(i- 


-p) (-+-)] [■ 

\m nj j ] 


(24) 

(25) 

(26) 


In large samples, therefore, it might be expected that 

d _ ai m--bjn 

would be approximately normally distributed. Since by the nature of the problem the 
common value of p was unknown, an estimate was made from the sample, namely, 


u + 6 _ r 
m^n N‘ 


(27) 


Substituting this into equation (26), we have 

d aim —bjn _ 

^ “ ^[{'r'IN){l-rlN) (l/w-|- Ijn)] 
a — rmjN 



( 28 - 1 ) 

(28*2) 


44, The form (28*2) is easily derived from (28*1), if we remember that b — r — s = N — r 
and m f-n = It is seen that the ratio djs^i is identical with the ratio u of equation (22), 
except for a factor ^[{N-l)jN] which is unimportant in large samples. Thus the classical 
test is practically identical with that suggested in paras. 40-42 above, though the two tests 
are differently derived. 


* A third alternative form is, of course, {ad — bc)\^NI^J(7nnrs), 
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(viii) Problem III 

45. This may be described as the test for the independence of two characters A and B, 
It is supposed that the probability that an individual selected at random will possess 
character A is 'p{A) and that he will not possess it is ^{A) = 1 —^(4). The corresponding 
probabilities for character B arep(B) andp(J5) = Pour dternatiye^combinations 

of the characters may occur, which may be denoted by AB^ AB^ AB and AB. The various 
probabilities are set out in Table 6 A. If the null hypothesis, specifying the independence 
of A and B is true, then 

p(AB) = p(A) X p(S), p(AB) = p{A)p{B), etc. (29) 

To test the hypothesis, we have a random sample of N observations with frequencies of 
occurrence of the combinations A B, AB, etc., which may be classified in the 2x2 scheme of 
Table 6B. The sampling conditions are such that the probabilities of Table 6 A are the same 
for all individuals selected, or, in conventional terms, the sample is drawn from an infinite 
population, Barnard calls this problem that of the double dichotomy. 


Table 6A. Probabilities Table 6B. Sample data 



A 

j 

Total 

B 

a 

c 

m 

B 

b 

d 

n 

Total 

r 

8 

N 



A 

A 

Total 

B 

p(AB) 

p(lB) 

p{B) 

E 

p(AB) 

p{,AB) 

p(B) 



p(A) 

1 


46. In Problem III there is only one application of a random process, the selection of N 
individuals, each one of which must fall into one or other of four alternative categories. If 
the random process were repeated and another sample of N drawn, not only are the fre¬ 
quencies a, 6, c and d free to vary, but also both marginal totals, i.e. m may change as well 
as r. The experimental probability set will therefore .contain results (a, 6, c, d) restricted by 
the conditions (i) that none of the frequencies can be negative and (ii) that 

a ^ b ^ c d == A. (3fi) 

Geometrically, as Barnard points out, the set can be represented in 3 dimensions by points 
at unit intervals within a tetrahedron obtained by placing on top of one another the series 
of 2-dimensioned lattices of dimensions 

Oxn, lx(7i-l), 2x(n-2), (m-l)xl, mxO. (31) 

47, We are again testing a composite hypothesis and should like to determine a family of 
critical surfaces to be used as significance levels, dividing the points within the tetrahedron 
in such a way that the chance of the sample point (a, 5, c, d)* lying outside a given surface 
Bg is equal to e, whatever the values of the unknown probabilities p{A) and p(B). But 
again, as in Problem II, owing to the discontinuity in the set of points, there are no ‘ similar 

In view of the condition (30), the point can be defined by three co-ordinates, e.g. as (a, 6, c), 
(a, 6, m) or (a, r, w). La view of the form of equation. (32), the last system of co-ordinates will be used. 
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regions’. We note that if Hq is true, the probability of the observed result is a term of the 
multinomial expansion, viz. 


N\ 

a\b\o\d\ 


'p{AY+'^ p[By‘+'‘- 


N\ 
m\n\ 


p{BY^{l — 'p{B)Y'-x 


m 

r!s! 


MAn^-p{A)rx 


m\n\r\8\ 
a\h\c\d\ 


= P%{^ 1 P(-B), N) X P^r 1 p{A), N} x Pi{a | N,r, m}. (32) 

Here, the notation of para. 30 has been repeated, 

48. Thus the probability of obtaining a sample represented by the triplet (a, r, m) may be 
regarded, if the characters A and B are independent, as the product of three terms: 

(i) The probability of drawing m individuals with character .5 in a random sample of N, 
i.e. the probability that (a,r,in) falls in a horizontal section of the tetrahedron on which 
m = constant. This is the (m + l)th term in the expansion of the binomial 

{{l-p(B))+p{B)}^. 

(ii) The probability of drawing r individuals with character in a random sample of JV, 
i.e. the probability that (a, r, m) falls on the vertical section of the tetrahedron on which 
r = constant. This is the (r-h l)th term in the expansion of 

{(i-p(4))+p(^)}^. 

(iii) The probability, given m and r, of the observed partition within the 2x2 table. This 
term represents the relative probability associated with the points lying along a straight line 
m — constant, r = constant; it is, of course, the same expression as has arisen in Problems 
I and II and is proportional to a term in the hypergeometric series F{-r, —m,Ti-r+l,!). 

49. We are faced with a situation similar to that met under Problem II. Were it possible 
to cut off from each line on which m = constant, r = constant, a group of points such that 

s [PJa IW, r, m}] = e, (33) 

a 

then the subset of points within the tetrahedron composed of the sum of these groups for 
all possible combinations of m and r would have the property required of a ^ critical region ’ 
in a significance test; i.e. the chance that the point (a, r, m) is included in the region, if jHq 
is true, would he e whatever values the irrelevant probabilities p{A) and ^{B) assumed. 
However, (33) cannot be satisfied in general, and all that is possible is to define a family of 
significance contours such that the chance of a sample point falling beyond any one of them, 
say ig, is < e, By using the normal approximation to the sum of the hypergeometric tail- 
terms with the correction for continuity as described in paras. 35-39 for Problem II, we shall 
be very much on the safe side, i.e. the formal level of e is likely to be much above the true 
chance of falling beyond the level, whatever hep{A)ovp(B). The presence of the two binomial 
terms in equation (32) instead of the single term in equation (8-3), makes it likely that the 
overestimation of e will be greater in Problem III than in II. It is to be expected, therefore, 
that any any rate when neither m, n,r or s are too small, the better approximation will be 
obtained by referring the u of equation (22) to the normal probability scale. 

60. The handling of Problem III is discussed briefly by Barnard on p. 136 above. 
There is clearly room for further investigation. The general nature of the approximation 
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involved is of course that which arises in every for goodness of fit or for independence 

in mhxk table, where we replace a distribution consisting of a finite set of probabilities at 
discrete points in multiple space by a continuous distribution for which integration outside 
ellipsoidal contours is straightforward. 

(ix) Geneiul comment 

61. The duties of the statistician lie at many levels. He may be required merely to apply 
an established technique of analysis to an assembly of numerical data and this application 
may result in a statement, based on probability theory, of a 'level of significance' or a 
'confidence interval’, which will be used by others. Or he may be called on to share in 
planning the investigation or experiment which is to provide the data and then to draw 
conclusions from their analysis which will lead to further action. In this final role he needs 
to bring into play faculties which are no monopoly of his calling, the qualities of sound 
judgement which are the characteristics of a well trained, scientific mind. In the weighing 
of evidence, the result of the statistical analysis, expressed in one or more conventional 
probability figures, is only one factor in the summing up; as important, may he, is the 
question of whether the mathematical model is a fair counterpart to the happenings in the 
observational field. In addition, there will often be much information coming from outside 
the range of the immediate investigation, yet hardly expressible in numerical terms, which 
must influence decision, 

52. It is perhaps hard experience gained in certain fields of war-time research, where 
decisions had to be reached on statistical data far less ample than could be wished, which has 
forced my own attention to this question: What weight do we actually give to the precise 
value of a probability measure when reaching decisions of first importance ? One subject for 
examination falling under this inquiry is clearly the logical basis of the reasoning process 
by which judgement is influenced as a result of the application of a test of significance. This 
was the theme on which this paper opened. The approach illustrated in the pages which 
followed is a personal one and is set down, with no claim to be the best, in order to provoke 
thought and discussion. There appears no short route to a right answer in this matter; each 
individual who hopes to use his own judgement to the full in drawing conclusions from the 
statistical analysis of sampling data, must decide for himself what he requires of probability 
theory. 

53. In the approach which I have followed and illustrated on the analysis of data classed 
in a 2 X 2 table, the appropriate probability set-up is defined by the nature of the random 
process actually used in the collection of the data. Consideration of this point forms the 
initial step in the determination of the appropriate test. On this score, what I have termed 
Problems I, II and HI are differentiated. The difference is fundamental and lies at the 
bottom of the dilemma to which the Barnard-Fisher correspondence in Nature drew atten¬ 
tion, It can be illustrated on the following data, given in Table 7, where I shall suppose that 
the effect we are interested in is that making a significantly greater than 6. 

54. If (a) the results have been obtained by random assignment of Treatment 1 to 
eighteen out of thirty individuals and Treatment 2 to the remaining twelve, and 

(b) we merely ask whether the results are consistent with the hypothesis that the treat¬ 
ments are equivalent as far as these thirty individuals are concerned, so that the difference 
between the proportions 16/18 and 5/12 may reasonably be ascribed to a chance fluctuation, 
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(o) we are then concerned with Problem I, i.e. simply with the probabilities associated 
with the points (a, 20—a) on the diagonal r = 20 of Pig. 4. The chance of getting a > 15, if 
the null hypothesis is true, is 0-0241,* or, using a common phrase, we can speak of the result 
being significant at the 2-5 % level. 

66. On the other hand, if a sample of 18 has been drawn randomly from one population 
and a sample of 12 independently from a second and we wish to test whether Pi{A) == 
then it seems to be an artiflcial procedure to restrict the experimental probability set to 
the 11 points on the liner = 20, i.e. to the values ofa; 8, 9,18. A repetition of the double 
sampling process could give us a result {a, b) falling at any of the 19 x 13 = 247 points in 
the lattice diagram of Pig. 4. There will be a number of ways of defining a family of signi¬ 
ficance levels for this 2-dimensioned set; if we adopt that discussed in paras. 40-41, which 


Table 7 


For problem I 

For problem II 

Frequency of results 

Total 

A 

A 

1st treatment 

2nd treatment 

Sample from 1st population 
Sample from 2nd population 

a = 16 
6=5 

__ 

c = 3 

m = 18 
n= 12 

Total 

0 

II 

s = 10 



gives as two of its members the dotted, stepped lines shown in Pig. 4, we can say that the 
chance of a result falling beyond the lower line is certainly less than 0*016. j* The observed 
point, with a = 16, 6 = 5 falls beyond the line, so that the result is undoubtedly "significant 
at the 1-6 % lever. 

66. These two probabilities, 2-5 and 1*5 %, are not the same, but there is no inconsistency 
in their difference. The character of the two investigations is different and to treat Problem II 
as though it were Problem I seems to call for a probability set-up which is unnecessarily 
artificial, when a simpler one is available. Admittedly by getting what seems to me a closer 
relation between the probability set-up and the experimental procedure, we have sacrificed 
some simplicity in handling the 2x2 table. But this is only the case when dealing with 
small numbers, Por large numbers the methods of handling Problems I, II and III become, • 
practically, identical. 

57. Consider again the heavy shell problem described in para, 7 above. If we are to intro¬ 
duce probability theory, it seems to me that we should regard the problem as one in which 
we have a sample of m = 12 from the possible output of shell made to one design or by one 
firm and of = 8 from the possible output of a second. This sampling may be hypothetical 
in that these may be "pilot’ shell, the first off production; nevertheless, this construct is 

^ For the normal curve approximation, using the correction for continuity, we find 

(15~.J~-12-0)/1-2866= 1-943. 

The proportionate area under the normal curve beyond this deviation is 0-026. 

t Table 5, col. (6) shows the largest value of this chance to be 0-0120 forja = 0-3. This figure cannot 
be much exceeded for other though I have not determined the precise maximum. I give 0-015 as 
^ aafe-side limit. 
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clearly less artificial than one in which, on the null hypothesis, we regard the experiment as 
though it were made on twenty shells, to twelve of which has been randomly assigned the 
label ‘Made by firm X ’ and to the other eight, ‘Made by firm Y\ 

5B, It is clear that in the heavy shell problem there may be many reasons to doubt 
whether the rounds fired can be regarded as a random sample from future output. That is 
why I have emphasized that the exploration which the statistician makes in private will not 
necessarily be presented in figures at the conference table. In this example, the proportions 
of successful perforations were 2/12 and 6/8; these put us on the line, r = 7, of the lattice 
diagram for which the hypergeometric probabilities were shown in Fig* 2. The sum of the 
terms with is 5-2% (normal approximation, using the ^-correction, 6-6%). This is 
the chance of getting as great or a greater positive difference, 6 — a, if Hq were true, treating 
the case as Problem I. Barnard’s method has not yet been extended to cover this case, 
but if we were to use the large sample method for handling Problem TI, described in my 
paras. 40-41, we should find from equation (22) that 

t^:=(2-4-2)/l*072 = -2-05, 
which puts (a, b) outside the upper 2*6 % level, 

69. Were the action taken to be decided automatically by the side of the 6 % level on 
which the observation point fell, it is clear that the method of analysis used would here be 
of vital importance. But no responsible statistician, faced with an investigation of this 
character, would follow an automatic probability rule. The result of either approach would 
raise considerable doubts as to whether the performance of the first type of shell was as good 
as that of the second, but without the whole background of the investigation it is impossible 
to say what the statistician’s recommendation as to further action would be. 

60. In the example of the proof of anti-tank shot discussed in para. 6, the chance of 
perforation, j>, while varying from plate to plate and batch to batch, will almost certainly 
not range through the whole interval 0-1, The striking-velocity of the shot would also 
probably be adjusted so that for average proof-plate and batches, p was near Then the 
discriminating level (or levels*) set across the 13x13 lattice diagram would be fixed paying 
regard to the likely variation in p\ thus a fairly close upper limit could be calculated to the 
true probability of (a, 6) falling beyond the level if the fresh batch were of the same quality 
as the standard. This is the upper limit of the risk of segregating the batch wrongly, 

61. Precisely similar problems arise for consideration in even more difficult form in the 
analysis of data arranged in a A x ^ table, where A or A or both are > 2. It has become common 
practice to speak of the solution of this problem in terms of ‘fixed marginal totals \ but it 
may be questioned whether the restriction in the experimental probability set implied is 
generally appropriate. The frequencies in a A x A table may have been obtained by many 
different sampling procedures for, as in the 2x2 problem, a single form of tabular presentation 
will follow from a variety of types of investigation. For most of these, a repetition of the random 
process of selection would give results with either one or both sets of marginal totals changed. 

62. For convenience in solution we may, of course, start by considering the distribution 
of our test criterion, on the null hypothesis, within the sub-set of results for which the margins 

♦ It is possible that two levels might be taken with the associated proof rules: (i) if (a, b) falls heyond 
the outer one, reject the batch; (ii) if between outer and inner, fire further rounds; (iii) if within the 
inner level, accept the batch, 
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are fixed. If this distribution were the same whatever these fixed values, then the overall, 
distribution for unrestricted sampling would be the same as that for variation subject to 
fixed margins. Thus, mathematically, the solution of the partial problem would be a step in 
the solution of the complete one. But when applying analysis to an A x A table, this result 
is only true as a large-sample approximation. 

63. If we use the mathematical model which it is suggested gives the most direct aid in 

reasoning from the observations, i.e. that which regards the experimental probability set 
as generated by a repetition of the random process of selection used in. collecting the data, 
then in the majority of cases we cannot regard the marginal totals as fixed. Thus a rigorous 
treatment would lead, as in the case of the 2x2 table, to a dilFerentiation into a number of 
solutions. It is to be hoped, however,* unless the numbers in the margins are very small, 
that the ^ approximation with its appropriate degrees of freedomt results which 

are not misleading. This approximation leads, of course, in the 2x2 table to the reference 
of the ratio u of equation (22) to the normal probability scale. Some aspects of the approxi¬ 
mation in this more general case were discussed by Yates (1934, pp. 233-35), 

64. In closing I should like again to acknowledge my indebtedness to Mr G, A. Barnard. 
Having had the good fortune to discuss these problems with him and see drafts of his work 
over a period of 2 or 3 years it is difficult to say how many of his ideas have been built un¬ 
consciously into my own earlier approach. But I am especially aware of the clarification 
which his emphasis on the distinction between Problems I, II and III brought to my survey, 
I am also very grateful to Mr M. G. Kendall, Dr R. 0. Geary and Dr B. L. Welch for a number 
of helpful criticisms, and to Mrs Maxine Merrington for her extensive computing work, which 
has alone made possible the various numerical illustrations that I have given. 

* From the point of view both of the exponents of the fixed marginal and unrestricted marginal 
approach. 

f The statement that, for example, in applying the test of independence of two characters to an 
hxh table, the degrees of freedom are (A — 1) x (Aj — 1), does not of course mean that sampling is re¬ 
stricted by fixed marginal totals. All that is implied is that approximately the overall distribution of 
the function of the observations used, is the same as that for sampling within the restricted sub-set; 
this is because the distribution within each sub-set is approximately independent of the particular 
marginal totals which define it. 


REFERENCES 

Barnabd, G. a, (1946a). Nature^ Lond., 156, 177. 

Barnabd, G. a. (19466). Nature^ Lond*, 156, 783. 

Fishkr, R. a. (1935). J. Boy. Statist. Soc. 98, 39. 

Fxsheb, R. a. (1941). Science, 94, 210. 

Fisher, R, A. (1945a). Nature, Lond., 156, 388. 

Fisher, R. A. (19466). Sankhyd, 1, 130. 

KEHDAnn, M. G. (1943). Advanced Theory of Statistics, 1. London: 
Charles Griffin and Co. Ltd. 

Nkyman, J. & Pearson, E. S. (1928a). Biometrika, 20 A, 196. 

Neyman, tT. & Pearson, E. S. (19286). Biometrika, 20 A, 263. 

Neyman, J. & Pearson, B. S. (1933). PMlos. Trans. A, 231, 289. 
Nbyman, J. & Pearson, E. S. (1936). Statist. Res. Mem. 1, 113. 

Neyman, j. & Pearson, E, S. (1938). Statist. Res. Mem. 2, 26. 

Pearson, K. (1899). Phil. Mag. 47, 236. 

PBZYBOBowsia, J. & WiiiENSHi, H. (1939). Biometrika, 13, 313. 
Wilson, E, B. (1941): Science, 93, 657. 

Wilson, E. B. (1942). Proc. NaL Acad. Sci., Wash., 28, 94. 

Yates, F. (1934). J. Roy. Statist. Soc. Suppl. 1, 217. 


II-2 



C 164 ] 


APPENDIX 


THE NOBMAL CURVE APPROXIMATION IN PROBLEM I 


1. The following Tables 8 and 9 (A), (B) and (C) show the order of accuracy which results 
from using the normal curve integral as an approximation to the tail sums in the series 


^. m \ n \ r \ s\ 
P^{a, I N, r, m} - "ffol c!d!N! 


(34) 


the terms of which are proportional to those in the hypergeometric series 

F{~r, — m, V —wi —r+1, !)• 

Here a is a variable which can assume the range of positive, integral values indicated 
under (i), (ii) and (iii) in para. 20 above, while N, r and m are fixed. The relation 
between these quantities and b, c, d, n and s is given in Table 1, para. 17. The method of 
approximation, using the correction for continuity, has been disous.sed in para. 26. 

2. Table 8 takes the case of an equal partition, m = n = \N, and shows the sum of the 
terms in the expression (34) for which which is also the sum of terms for which 

For results are given in Table 9 for m>n and for the following pro¬ 

portionate partitions of N ; 

(A) m=fiV, (B) m=%N, n = ^N; (C) m=*V, n=-^N. 

Here sums of terms at both tails of the series are needed. The sums (or chances of 
or < a^) have not been given for all possible values of but, broadly speaking, for those 
within the limits where significance is likely to be in question. Sums below O-OOIO have 
generally been omitted. In each case the true sum of the terms (34) is compared with the 
approximation from the normal integral. 


3. In drawing conclusions from the comparison, we have to decide what degree of 
accuracy is called for. Clearly the normal integral does not give mathematically exact 
results to 4 decimal places. On the other hand, except for certain instances where the 
partition is very unequal (m = and ^N) and r is small, the order of the approximation 
may be said to follow that of the series closely. If decisions are made by rule of thumb, 
according to the side of the 5 % or 1 % significance level on which a falls, then there are 
a number of entries in the tables where the approximation would give a on the wrong side. 
But one may question whether judgement of significance based on a single experiment can 
in fact be made sensitive to a difference between, say, 0-06 and 0-04 (odds of 16 to 1 and 
24 to 1) or between 0'012 and O-OOS (odds of 82 to 1 and 124 to 1) and, given such latitude 
in accuracy, the approximation will be found generally sufficient. These must be points, 
however, where personal opinions will differ. Whatever views are held, the tables are 
sufficiently extensive to make it possible to obtain from them a rough measure of the 
accuracy of approximation in a wide range of cases. 

4. It will be noted that in the symmetrical case (w = ^N) and also when m the 
normal approximation for the tail sum is almost invariably a little too large. Undoubtedly 
for the symmetrical case an improved approximation could be obtained by modifying 
the ^ correction used in calculating the ratio of deviation to standard deviation. This 
second order term would, however, need to vary with the probability level, thus com¬ 
plicating the procedure, 























































Appendix 

Table 9. Case of unequal partition. Chances that a^a^ and a'^ai 

(A) m = fAT, n-iN 


Partition 


■m = 80, 
n = 40 


m = 36, m =» 24, 

24 n = 16 


m = 18, 
n = 12 


m — 12, 

n = 8 


0-0020 0-0019 

•0074 *0074 0*0016 0*0020 
•0230 0231 -0084 -0093 


-1323 -1339 

•0680 -0604 

•0209 -0231 

•0061 -0074 

•0014 -OOIO 


0-0027 
•0114 1 
•0381 
•1019 
•2211 

4 0-2236 I 

5 -0994 

6 -0341 
•0086 
•0016 


0-0063 0-0063 
‘0236 -0233 

•0776 -0775 

•1948 -1968 


1 

True 

i 

Normal 

approx. 

0*0023 

•0270 

•1311 

O'OOBO 

•0329 

•1348 

0-1322 

•0318 

•0046 

0-1348 

•0329 

•0050 

0-0016 

•0112 

•0526 

•1666 

0‘0021 

•0128 

*0656 

•1696 

0-1666 

•0626 

•0112 

•0016 

0*1696 

•0666 

•0128 

•0021 


0-0016 

•0106 


Normal 

approx. 


Normal 

approx. 


0-0611 0-0616 

0-0678 0-0616 
•0036 -0051 


• 9 
10 


0-0006 

0-0010 

•0109 

•0118 

•0766 

•0770 

0-1341 

0-1378 

•0186 

•0269 


B 


0.-0690 0-0781 


0-0026 

•0241 

•1182 

0-0033 

•0260 

‘1218 

0-1170 

•P209 

•0016 

0-1218 

•0260 

•0033 

0-0004 

•0086 

■0681 

0-0007 

•0097 

•0704 

0-1250 

•0168 

0-1300 

•0232 

0-0066 

■0681 

0-0070 

•0701 

00601 

0-()701 
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Table 9 (contmued) 

(B) w = fZV,» = JIV 


(C) m n « 
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2x2 TABLES. A NOTE ON B. S. PEARSON’S PAPER 

By G. A. BARNARD 


As Prof. Pearson has kindly shown me the proof of his pai)er, I should like to make the 
following further remarks. 

1. If we have a sample of N from a population in which there is a chance p that an 
individual will have a character A, we can represent it in the form 


where is 1 or 0 according as to whether the tth member has A or not.* Regarding the 
cc's as quantitative variables, we have by classical results the unbiased estimates 

^ - (iX)/i^ and {^x.^x.f)|(N-l). 

If r of the are 1, while s are 0, we find 

and 

Using this unbiased estimate of variance in Prof, .t^earson’s para. 43, we get, instead of his 

d ^ a ^ rm/N 

Sfi I mnrs ' ^ 

V 

agreeing exactly with his (22). 

2. To carry the argument further, in classical theory, if we have two samples 
(a?i, 0 ^ 2 ,...,,xj and (y^, y^) 

to test whether the samples come from the same normal population we take 


a;. -y. 


7nn 


where a;. =( 2 ’x^)/w, 2 / . = [^yi)ln, and 


( 2 ) 


and use tables of the t distribution for (m + n - 2) degrees of freedom. 

It is common practice to neglect departures from normality in applying this test. If we 
do so, and apply it to our qualitative case along the lines indicated above, we get 


t = 


g —r m/iV 
acn + bdm ’ 
W(N^ 


which, if we are justified in our neglect of departures from normality, should lie distributed 
as t on 2) degrees of freedom. 


* For a similar argument see B. L. Welch (1938, p, 155). 
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3. To obtain the formula (1) on these lines, we have in effect to commit the well-known 
fallacy of replacing as given by (2), by 

m+n-l ’ 

where m' = {Zx^ + Zyj)|{m + n). 

We are led to aSk why (3) should be approximately correct (and in fact it is better than (2)) 
in the qualitative case, while (2) is preferred in the quantitative case. 

4. The simplest reason for preferring (2) to (3) in the quantitative case is that s'® is not 
independent of (a:, -y.), so that the conditions for validity of the t distribution are not 
satisfied. In our qualitative case this argument loses validity, since neither s® nor s'® is 
independent of (*. - y.). 

The second reason for preferring (2) to (3) in the quantitative case is more complicated, 
but for our purposes it reduces essentially to the fact that, in the case of normal distributions, 
and only in this case, the mean and variance of samples are independently distributed, so 
that the common mean value of the populations, estimated by m', is irrelevant to the test 
for differences. In our qualitative case, on the other hand, m' contributes to our knowledge 
of the variance. 

6. If we apply Pitman’s 'absolute’ analogue of the I test to our case, we arrive at the 
hypergeometric series of Prof. Pearson’s Problem I. But Bartlett’s argument, showing the 
convergence of Pitman’s test and the t test, will apply here only in very large samples, because 
of the finite probability of obtaining observed values which coincide. 

6. Prom the above point of view. Prof. Pearson’s analysis of his Problem II may be 
regarded in one sense as an examination of the effect of large departures from normality on 
the t test. In this fight, his conclusions given in paras. 51 and 52 are seen to extend to the 
t test, as well as to the 2x2 table problem. 

7. If I may state my personal attitude, it is that statistics is a branch of applied 
mathematics, like symbofio logic or hydrodynamics. Examination of foundations is 
desirable, but it must be remembered that undue emphasis on niceties is a disease to which 
persons with mathematical training are specially prone. In pure mathematics itself there are 
disputes on foundations which closely parallel the disputes over the foundations of statistics. 
The lesson to be drawn is, that while statistics is a most valuable aid to judgement, it cannot 
wholly replace it. 

8. Pinally, it must be emphasized that the order of printing of Prof. Pearson’s paper and 
my owir reflects Prof. Pearson’s generoisity rather than the historical order of events. Much 
of his paper was, unknown to me, given in lectures before the war; whereas my work on the 
problem began only in 1943. Since then I have owed much both to Prof. Pearson’s published 
work and to discussions which I have been privileged to have with him. 


EEJ’ERENCB 

Welch, B. L. (1938). BiometriJcaf 30, 15S. 
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THE OUMULANTS OP THE Z AND OP THE LOGARITHMIC f 

AND t DISTRIBUTIONS 

By JOHN WISHART 
School of Agriculture, Gambriige 

Explicit expressions for the exa-ot oumulants of Fisher’s x-distribution do not appear ever 
to have been published. They were therefore worked out, and appear in § 2 of this paper. 
It afterwards appeared that the logical method of presentation was to deal with the similar 
problem for ^log (x^jn),* since the g-distribution involves the simple difference of two such 
functions which are independent. This led to § 1. Since writing this paper, Bartlett & Kendall 
(1946) have published the same result in the form of the oumulants of log s®, and have given 
graphical and tabular representations for varying n up to 20. The solution is, of course, 
implicit in Cornish & Fisher’s (1937) statement of the moment generating function, while 
Mr 0. R. Rao has informed me that he reached the same result in work done for an M.A, 
Thesis of the University of Calcutta (unpublished). § 1 has accordingly been shortened, 
but is retained in view of the additional formulae to those of Bartlett and Kendall. 


1. The logarithmic x^ distribution 
The distribution of x^> for n degrees of freedom, is given by 

As pointed out by Cornish& Fisher (1937), the mean value of exp{^iiflog {x^jn)} 

i.e. of exp log (4;\;2) - \it log (in)} 

is the moment generating function of the distribution of J log {x^ln)j namely 

^ ^ { -Pfog 

The cumulant generating function is 

jK" = log JIf = “ \it log {^n) “f* log r\{n 4* it) — log r{^n). 

The oumulante of the distribution of ^log {')^jn) are readily written down by differentiating 
K successively with respect to it and at each stage putting t 0, We have in fact 

/Cl = - i log (in) + ^log r(in) 

=-i|loga+Lts_^i(g(s,a)--^j| (1) 

= (s>l,a = iu), 

where ^{s, a) denotes the generalized Zeta-function 


^{s,a) 


oo 


i=0 


1 


* All logarithms in this paper are to base e* 
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The cumulants may be readily computed by throwing them into the form 

( 2 ) 

where ir{x) = d{log r(x)]jdx, == d®{log r{x)}ldx^. 

f(x) is variously called the Psi or Digamma function, and its derivatives have been called 
the Trigamma, Tetragamma, etc* Functions, and the series the Polygamma Functions* 
These functions have been computed in some considerable detail. For n up to 22 the mean 
and variance can be got from Elinor Pairman’s ^Tables of the Digamma and Trigamma 
functions’ (1919). Tables up to Pentagamma appear in Vol. i of the British Association’s 
Mathematical Tables (1931), but with certain gaps which, although intended to be bridged 
by reduction formulae, render the tables less generally useful (for n less than 22) than 
H. T. Davis’s Tables (1933, 1935). Table 10 of Vol. i gives all that is required for i/r{x)iin 
Vol. n, Tables 14-16, 18-20, 22-24 and 26-28 cover a wide range up to Hexagamma.. 

As shown by Bartlett & KendaU (1946), the approach to normality is very slow. For 
n - 24 (the limit for of the z table of Fisher & Yates (1943), which provides percentage 
points for the distribution under consideration in the line = oo) the cumulants have been 
worked out to Xg, the last being specially computed from its formula given below. The gamma 
ratios are = —0*296, y^ — 0-174, = —0*164 and y^ = 0*176, and | y | increases there¬ 

after at this level of n instead of tending to zero. Approximate percentage points may, 
however, be worked out by using the formulae at the foot of the z table, putting - oo. 
For small n, we note that 

C{s) = i = Ys + +|-s + • ■ • + r) r an integer, 

g(s).(l-2-«) = i+i + i+... + Go 


111 1 

—- "4“ — H-h*... -f* —. .. 

6» (2r-l)' 




We thus get, forn = 2r, 




2» 


S(s) -[^+Js+Js+-+ 


in which the terms in {...} reduce to ^(s) for n = 2. 
For n = 2r+l 


= (-i)n.-i)!{s(.)(i-2-)-(i+p+^+...+^,)) (a>i), 


(3) 


(4) 


in which the terms in {...} reduce to ^(«) (1 — 2'~®) for n = 1. 

In the special case 0 / 5 = 1 we have 

^ = 2r + + + + + 

n = 2r+l =-K 7 + log2w) + (n-^+^+...+^^). (6) 

For n = 2 and 1 respectively these expressions reduce to the first bracket. ^( 5 ) can be got 
from tables, and in particular 


g(2m) == 
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where the B’e are the Bernoulli numbers B^ = B^ — ^q-, B^j = etc. 

y is Euler’s constant. For reference we may quote: 

y = 0-57721 56649, C(4) = 1-08232 32337, 

C(2) = 1-64493 40668, ^(5) = 1-03692 77551, 

C(3) = 1-20205 69032, ^( 6 ) = 1-01734 30620, 

Note that ==')' +Jo 

For large n, asymptotic formulae for the Zeta-function may be used, and we get 

1 ^ (-4)/-iB, 

^ 2n 3 -=i 

1 1 1 8 8 64 

2n 63«,®'^16«,8 33r!,i‘>’^ ' 




-l)»p 


-2)! , 

2n^ 




‘4ry-'^ Bj{2j + s-2) 




r 


( 5 > 1 ). 


We may note in passing that not only may this general expression for Kg be applied to the 
special ease 5=1 with the proviso that the first term in that case is dropped, but also that 
/Cg may be obtained &om /Ci-f ^log {^n) by term-by-term differentiation with respect to n, 
and likewise from K^y /C 4 from /Cg, etc.»by similar term-by-term differentiation. This follows 
from a property of the Zeta-funotion. It is therefore not necessary to write down the explicit 
expressions for ATg, etc., but we may note that their leading terms are 
— 3^“^, 12 n“^, etc., so that the leading terms of and are — respectively, 

while y^ is More exactly we have, writing 72 .' — n — 1, 




1 7 

-1-L Q 


(i)) 


with corresponding expressions for Xg, etc., obtained by differentiation with respect to 


72 




7r-l 


iS)' 


Finally, if instead of the distribution of |^log (x^/n) we are interested in the distribution of 
log ( 5 ^), where 5 ^ is an estimate of cr^ based on n degrees of freedom, we have 

log (x^jn) = log 5^-log 0 *^ 
and thus fox the distribution of log ( 5 ^) we have 


^<^8 = (~l)®(s-l)!S(s,a) (s>l,a = |w), 

while the y ratios are the same as for ^log {x^jn). Obviously logx log s can be treated 
similarly. See Bartlett & Kendall (1946). 
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2. The z disteibution 

The distribution of z = ^log (s^/ 5 |), where sf and s| are independent estimates of a variance 
based respectively on and degrees of freedom, is obviously that of 

and its cumulants may therefore be at once derived from those of the logarithmic dis¬ 
tribution. The oumulant generating function is 

K = logM = itilog(Va/vi) + logr|(Vi-t-i«) + logr^(v 2 -ti)-logr(Jvi)-logr(iv 2 )- 
Further, we have 


/Cg = 2-«(5-l)!{^(s,a2) + (-l)*^(s,ai)} (s>l,a3^ = = ^v^). 

For computing purposes these may be thrown into the forms 

2k^ = log {v^lv{) + i/r{iVi)~i/>{lv 2 ), 

+ (- 1 )^ («>!)• ( 8 ) 

To illustrate, let us take = 24, = 60. We then have from the Polygamma tables (except 

for ACg, which was specially computed): 

/Cj == ^0-0127 429, Xg--0-0007 998, /Cg =- 0-0000 104, , 

/Cg = 0-0301 992, /c^ == 0-0000 867, Kq = 0-0000 019, 

0-1737 792, 


= -0-152, 72 = 0-096 (or /?i = 0-023, = 3-096), indicating the degree and nature of 

the departure from normality. 73 and 74 are — 0*066 and 0*067 respectivdy. 

If as a first approximation we assume that for and Vg of the order of the numbers chosen 
in this example, or higher, z is distributed normally with mean and variance given by the 
above formulae, we obtain approximate percentage, points, e.g. for the 96 and 5 % points 
we can subtract and add l*6449{r from and to the mean. The result in the present case is to 
give us — 0-299 and 0-273, the correct values being — 0*306 and 0-265. The approximation is 
adequate to almost two figure accuracy, and is evidently useful when we only require to 
know whether an observed z is significant or not, A better approximation is provided by 
the formulae attached to the z tables (see Fisher & Yates (1943)), which yield — 0-3046 and 
0-2663 as against the correct values of —0-3055 (see Thompson (1941)) and 0-2664. 

Explicit algebraic expressions are readily written down for the cumulants for small 
and ^ 2 , using the same method as for the logarithmic distribution. Where it is necessary 
to do so, Vi will be assumed less than V 2 * In the contrary case we need only interchange 
and changing the sign of the odd cumulants in so doing. The odd cumulants are zero when 

h == ^ 2 * We have 

Even cumulants (r = 2 s, s > 0 ) 

= 2^, 1^2 = 2^ 
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(/j = 2f)+1,1^2 = 2g'+1 

= v^. Drop out the last bracket of terms in the above two cases. 

= 2p, = 2q+ 1 

/c, = (r - 1 )! [C(r) - i + •.. + 1 + 3 r + gr + ■ • • + ’ 

j/j = 2^ +1, 1^2 — 2q. Interchange and Vg in this last case. 

Odd mmulante (r = 25 4-1> 5 > ^) 

== 2p, Vg = 2 g, or = 2 ^ -h 1 , Vg = 2 g +1 


Vi = 2p, ^2 = 23 + 1 

AC, - (r - 1 ) ![^{r) (1 - 21 -’') + + i + ... - (l + ~ + ^, + ■ • • + (i;“ 2 y)_ ’ 

= 2p+1, V 2 = 2q. Interchange and Vg in this last case, and change the sign of /c,. 
In the special case of s = 1 , we have 
(Aj = 2 p, vf= 2q, or = 2 p+1, = 2 g+ 1 


Vy = 2p, V2 = 2g+1 


ACi = ilog - [— H-+ ... H-. 

\^iJ + 2 ^2 — 2 / 

= 2g+l 

^i = ilog(7j)+log2 + (i + i+...+^)-(l + i + ^+... + j;^).^ 


= 2j5 4-1, Vg = 2q, Interchange and in this last case and change the sign of 
For large and Vg, a combination of the asymptotic formulae already given readily yields 
the following results: j ^ ^ v ,0 ^ _ ^-i ^ / 1 1 \ 

2 j~~ ^ 4 ’ 

the numerical coefficients being as for the ac^ of Jlog ix^ln), 


,/ J_ , (z2I‘\, (^-1)! /i., (-1A 


+ 2 i (-4)^~^-5,(2i + g- 2 ) ! / 1 


,21^8-1 j • 


We may put 5 = 1 in /c^ provided we drop the first term. We note also that /Cg and higher 
oumulants can he written down immediately by differentiating the terms in and of 
/Cl—^ log (v^jv^) successively with respect to — and respectively. 

These are the results given by Oornish & Fisher (1937), whose formulae can be extended 
at sight by means of the results of this paper. A first approximation not only gives the 
familiar results ^ 

2^2 vj’ ^*~2 (v2''’i'i)’ 

. . (^- 2 )!/l (-in 
>r^ )’ 


but also the more general 
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but it should be noted that for all s > 1 a second approximation, which takes in an additional 
term, is (s-2)!/ 1 (-l)M 

~ \ K - 1'(^'1 -1 rv ■ 

The accuracy of the asymptotic approximation at the limits of the z table given by Fisher 
& Yates (1943) can be seen by applying it to our example == 24, - 60). The numbers 

of terms which are significant in the eighth place (needed for final accuracy to 7 decimal 
places), are three for Ki, four for and /Cg, and three for and Kq. The first term for 
namely 12(vJ^ + yields 0-0000 016, rather more than 20 % too low. To use 


would give 0-0000 019, about 2 % too high. 

Should or h© only moderate in size, the other being large, we may make use of the 
relation 1 ] 1 

^ + fiTlT* +■■■■" + £(»,• + '), C an integer). 

where a is one-half of the smaller of or Vg, to convert our formulae into forms in which 
asymptotic expansions may be applied to both of the Zeta-functions. We then have (v^ < Pg): 


K,.{s- i)![2-B(., w+(-!)■!:(». K+••)}+(-!)■ {;^,+(57^.+ •■ ■ + - 


and 




_ _ I_^ { -iy-^B^{2j + s-2)\ 

! — 1 ) 271'" ^ (s — 1 )! 


(17) 


i + x~,+ , 


Particiilar cases of some interest arise (i) when r — Hv^ — Vi), and being either both odd 
or both even, and (ii) when r = +1), (or Vg) being even and (or Vj) odd. In the 

former case the first term within squared brackets in is 2~®(1 + ( — 1)®) ^(s,|va), which is 
zero when s is odd and 2^-®^(s, when s is even. In the latter we have 

which is ^{s, Vg) when s is even. With s odd we are concerned with the difference of two Zeta- 
functions in which the a’a differ by one-half, and the expression may be written 


and 


^ (-ly __ 1 /dv-^ 

i^niv^i+jY (s-l)!\dv2/ 

“ (— 1)’ r 1 dx 

y-ni' 2 +i~Jo l + '-K 


S 




(- 1 )^ 

Vi+3 


A 2 ’-+i(va-fj)! 


on integration by parts 


_ J_ I V 

on expansion in powers of This asymptotic expansion is an interesting one in which the 
early coefficients are very simple, for the series is 


11 1.1 17 
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The various cases are set out below: 

Even aonulants (r = 2s, s> 0) 

I'l = 2p, I'a = 2q, or I'l = 2/^+1, = 2q-\-l 

'c. = (r - ]) + • • • 


(r-2)! (r-1)! 1 ” (-4))g,.(2j + r--2 )! 

if-i vt ur.A 


I'a ^’23=1 


Vy = 2}),Vz=2q^-\ 


, (r-2)! , (r-1)! 1 ” (-l)^5,(2j + r-2)! _ 

+ vr^ 2ui (2j)!vr-i • 


OiA cumulants (r = 2s +1, s > 0) 

= 2p, 1^2 = 2q, or = 2p+l, = 2q+l 




1/, = 2p, 1^2 = ^+ I 


= — (r- l)!l-; + 7r“7-^+... + 


' (v, + 2 )-- ' - (3^2-2)^!’ 

1 


( 21 ) 


rj (r, + 2 )-- •• (132-1)' 


, (r-1)! , 1 ” (-l)M(2^>-l)3g/2j+r-2)! 

2vl ^ ' 


In the special case of s = 1, we have 

Mj = 2p,v^ = 2q, or = 2jp+l, = 2q+l 


''■i = i log I 


'1 1 

~ H—7-7: +. ♦. "i— 


•'i = 2p, Vi-2q + l 


pj l^'i 'T + 2 




1 1 


r-UbrrH ri ^ I I Ml ^ I ^(-l)'-^(2*'-l)5i I... 


3. T]H[i?i logarithmic ^distribution 

When vj — 1 .2 = log j ^ |, and we thus have as a special case for the distribution of log 1 1 \ for 
1^2 = n degrees of freedom : 

2/c, =. log tM- Lt,^i {^{s, hn) ^ J )} (24) 

- logn^i//{l)^i/y{hn) = -y^21og2). (25) 

For small n 


11 

/C 2 =4]0g 33-log 2-(-+-+. 


n :=2p-\-l 


(26) 


For large n 



».~-i(r+iog2)4+.£'-‘''-‘S,, 

2n V-i 

(27) 
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Also 2»>r3 = (5-l)!{^(5,^») + (-l)»^(s,i)} («>1). (28) 

= + = (-l)«(s-l)!(2»-l)^(s)). (29) 

For small n w^e have the following cases: 

Even cumulants (r = 25, ^ > 0 ) 

n=lt> '''-<’-‘)'{*:M-(r. + f+- + (S^27-))’ (30) 

. = 2„+l ,,= (,-l)!{2SM(l-2-0-(l + ^l + i + ^.,+^^)], 

Odd cumulants (r ==: 25 h- 1, 5 > 0) 


n = 2^, ^^ = -(r-l)!|^(r)(l-2i-) + (4 + ^+- + ^^,)}. 

W= 2 ^+l ~ + ^ + ^+••■ 


(31) 


{n-2r}- 


For large n 






( 5 > 1 ). 

(32) 


In the special case of n = oo we have for the distribution of log \x\, where x is a normal 
variable with zero mean and unit standard deviation: 

^i = -i(r + log2), AC, - (33) 

as follows also from tho case of |*Iog (x^/n) on putting n — 1. 


4. Note on the nismiBUTioN apphoximation 
F isher’s result that is approximately normally distributed about a mean of ^j{2n - 1) 

with unit variance (n being the number of degrees of freedom) is well known. The demon¬ 
stration depends on showing that the mean value of x 

ACi - ^2r\{n-\- for large n 

and that the variance is — /cf ^ 

but to this order of approximation it is not possible to show that and yg tend to zero with 
increasing n, A formula for the ratio of the two Gamma functions, developed as far as terms 
in (see Wishart (1925)), gives yi^( 27 i)-'i and yg = 0{n-''^) (see, for example, Kendall 
(1945)), but owing to the vanishing of the term in of yg its leading term has so far not been 
accurately obtained, although the exact (but somewhat complicated) expressions for the 
/?! and /?2 of the distribution of 5 — ^xl^ji"^' + 1) wore given in an editorial in Biometrika (1915), 
10, 622. 

I 00 / _ 1 )i~i (2^^ ’—I) B 

Since mUn-^ D-fiM ^ .S ^- ' ^ 

by the formula given in § 2, we find on integration, and insertion of the appropriate constant, 
that 00 n^. 

log riin + 1) - log r(|«.) = i log ihn) + 2 j( 2 j-l)«, 2 )-i’ > 


and thus have 


ri{n+l) 

r{\n) 


V(in)exp-i 


ih ' r 


(34) 


Biometrika 34 


12 
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which can readily be expanded to give the additional terms necessary to enable the cumu- 
lants of (or of V(2 a;“)) be worked out (see Johnson & Welch (1939)). Taking 
as the first approximation we find 


r|(^+i) 

mn) 



exp 


1 


1 


16?r2(^ 8??,^ 


A 

4n® 


i)) 


4- f)(«.“^'®), 



(36) 


thus providing a second approximation 
one-half. The cumulants of V(2X^) 

^i = V(2n-l)( 


to the ratio of two Gamma functions differing by 


*^2 


i _ A -A 


0{n-\ 


The Editorial in Biomdriha (1915), 10, 523 calls attention in a footnote to ‘Student’s’ 
approximations for the and ^2 of fhe sample standard deviation. The above formulae 
show that ‘ Student’s ’ results should be 



1 9 31 \ 

**" Sw®) 


+ 0{n-% 



J_ J__ 



in which n is now the size of the sample. For = 10 these give values too low by 2 and 5 
respectively in the fourth place of decimals. Practically four-figure accuracy can be attained 
with n as low as 10 if in the terms in n-'-^ we replace 31/8 by 17/4 and 7/8 by 1. 
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THE MEANING OE A SIGNIFICANCE LEVEL 


By G. a. BARNARD 


A level of signiticance is a probability. To say that a given result is significant on the 5 % 
level means that some class of events has probability 0*05, Now whatever theory we may 
hold as to the nature of probability, in order to give a statement of probability a precise 
meaning we must refer to some reference class, or set of data, on which the probability is 
calculated. What is the reference class involved in a level of significance? 

To many people the answer to this question seems simple enough. The reference class 
involved is the set of indefinite (possibly imaginary) repetitions of the experiment which 
gave the result in question. Otherwise put, the data, on which the probability is calculated, 
are the external conditions of the experiment. The following example indicates, however, 
that the meaning of this reference class is not always clear. The example is a modified form 
of one given by Prof. R. A. Fisher in a letter to the author, 

Suppose we have a bag of chrysanthemum seeds, known to give plants having white 
flowers or plants having purple flowers, no other colours being possible. We suspect that 
the proportions of white and purple seeds are equal, and to test this hypothesis we select 
at random ten seeds from the bag, and plant them. Nine of the plants grow to maturity, 
and all of them have white flowers. On what level of significance can we reject the hypothesis 
of equality of proportions ? We may assume that white and purple plants are equally viable. 

It would be natural to argue that, if white and purple flowers were equally likely, the 
probability of our result would be 1/2^. If there is no reason to suspect an excess of white 
rather than an excess of purple flowers, we must add to this the probability of getting nine 
purple flowers, which is also 1/2®, giving a total probability of 1/2®. The hypothesis of equality 
of proportions would then be rejected on the 1/266, or the 0*3906 % level of significance. 
But if we did this our reference class would not be the set of indefinite repetitions of the 
experiment, in its ordinary meaning. 

A repetition of the experiment, in its ordinary meaning, would consist of another selection 
of ten seeds from the bag, and their planting and growth. On such another occasion all ten 
plants might grow to maturity, or all or some might die. These possibilities have not been 
taken into account in our calculation of probability, so far. 

To allow for the possible variation in the number of plants which grow, we might lay out 
the set of all possible results of the experiment as in Fig. 1, where n denotes the number 
of plants that grow, and r denotes the excess of white over purple. Thus any point in the figure 
can be referred to uniquely by its co-ordinates {n, r). If we now introduce a parameter p, 
to denote the probability (if it exists) that a plant will grow to maturity, given that it has 
been selected, the probability associated with the point (w, r) bn the hypothesis of equality 
of proportions of white and purple will be 


W(nf r\^) 


10 ! 

n!(10-7i)! 




n\2~^^ 


and since this is a function of the unknown p, we have a special problem of arranging the 
points {n, r) in order of significance before we can establish a test. The situation in this 
respect is similar to that dealt with in the paper on 2 x 2 tables, printed earlier in this issue 
(Barnard, 1946, pp. 123-38 above). 
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Proceeding as in the earlier paper, we notice first that the same level of significance must 
a^pply to {n, t) as to {n, -r), so that we can confine our further considerations to the upper 
half of the diagram, Now in this half, the transition from (n, r) to 1, r + 1) means we 
discover that one of the plants which failed to grow in our case, was in fact a white-flowered 
plant. In this case our conviction that there is an excess of white-flowered plants would be 
strengthened, so that (n +1, ^ +1) would be reckoned more significant than (n, r), Similarly, 
going from (ti, r) to (ti-i-1, r-1) would mean that a missing plant was found to be purple, 
and this would weaken our belief in an excess of white-flowered plants; consequently, 
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{n, r) would be reckoned more significant than (?i4-1, r-1), Finally, going from {n, r) to 
(tiH- 2, r) would mean growing two more plants, one purple and one white, and this would 
increase our tendency to believe in the equality of proportions. Consequently, (n, r) would 
be reckoned more significant than (ii-f 2, r). These principles taken together imply that 
points lying north-east, or west, of a given point (?^, r), or between these two directions, 
would be reckoned more significant than {n,r)\ while, conversely, points lying east to 
south-west (inclusive) from [n, r) would be reckoned less significant than {n, r). The relative 
significance of points lying inside the half-quadrants north-east to east and south-west to 
west would remain undetermined. 

We could now proceed as in the paper (1), building up a test, consistent with the above 
partial ordering, in such a way as to make the significance or otherwise of our result depend 
as little as possible on any knowledge we may have about the v^lue of p, But we need not 
carry this through for the result we have quoted, since our conditions by themselves require 
that the only points in the diagram which should be reckoned not less significant than our 
result are the points (9, 9), (9, - 9), (10, 10) and (10, --10). The probability associated with 
these four points is 

9;p) = 2(10p»(l--p).2~«+pi«2-W) 

-(p/2)»(20~-19p), 

the maximuna value of which occurs when p = 18/19, and is P^(9, 9) = 0*002413. Thus on 
this basis we should conclude that our result was significant on the 0*2413 % level, 
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The difference between the first result, 0*3906 %, and the second, 0*2413 %, is in practice 
negligible. Somewhat larger differences will be found in other similar cases, however, and 
it seems worth while to try to clarify the cause of the discrepancy. 

Consider three possible causes for the failure of the tenth plant to grow to maturity: 

(1) The bag from which the seed was taken is known to contain a proportion of dead seeds, 
which are physically indistinguishable from the live ones, and the tenth seed planted 
happened to be one of these. The conditions of growth were such that any five seed planted 
would have grown. 

(2) The tenth plant happened to be attacked by a soil pest, which destroyed it. 

(3) The statistician trod on the tenth plant while running for a bus; otherwise, it would 
have grown. 

If we now consider what would happen in these three oases if the experiment were 
repeated, in case (1) we should be just as uncertain as before how many plants would grow, 
out of those selected. In case (2), we might or might not happen to strike a good year for 
the pest in question, so that we might or might not have a similar accident recurring. In 
case (3) we should obviously give the statistician firm instructions not to be careless, and 
then we could be reasonably certain that all the plants selected would grow.* 

In the first case, we can suppose that the proportions of white, purple, and dead seeds in 
the bag are, respectively, and 1 — und the purpose of our experiment is to 

test the hypothesis = p^. In this case, putting p^-^p^ = P: we can clearly apply the 
analysis of Fig. 1, and the appropriate level of significance is 0-2413 %, 

In the third case, the situation actually realized is just what it would have been if we had 
warned the statistician beforehand, and then thrown one of the ten seeds back into the bag. 
Thus our effective sample size here is 9, and the appropriate level of significance is 0-3906 %. 

In the second case, the answer depends on our attitude to the set of accidents of which 
the pest is a specimen. If this set of accidents is regarded as a stable set of chance causes 
we may be justified in representing its effect on the growth of our plants by the prob¬ 
ability j). If, on the other hand, the incidence of such pests undergoes, say, regular cyclical 
fluctuations from year to year, so that its incidence is to some extent predictable, if not 
wholly controllable, then we should not be justified in assuming the existence of a real 
probability corresponding to our parameter^. We should, to be on the safe side, in this case 
allow for the possibility that experimental technique might improve in the future, to such 
an extent as to eliminate the possibility of such accidents. Thus, adopting this conservative 
attitude to our results, we should here treat the effective sample gize as 9, The repetitions 
of the experiment which we have in mind would then be imaginary repetitions, in which 
experimental technique was supposed to be better than it is now, and we have as much 
control over pests as we have over statisticians. 

The general situation illustrated by this example can be described in terms of the notion 
of ‘isolate’ introduced by Prof. H. Levy (1931). In making an experiment, we try to 
construct an isolate—a system, or part of the world, which we suppose has relatively little 
interaction with the rest of the world, and which, for practical purposes, may be considered 
on its own. This isolate may contain within itself all the systems of chance causes which are 

* It is not suggested that the three cases exhaust the multiplicity of types which might arise in 
practice. As Prof. Pearson has pointed out, if it were not the statistician, but bis three-year-old son 
who was the vandal in cose (3), we should have here a situation intermediate between our second and 
third instances. 
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regarded as affecting, to any practical extent, the results of the experiment. Such is the case 
in (1), where all the chance causes involved in the experiment are supposed given in 
the bag which is the subject of the experiment, Here, then, we are dealing with a 'good 
isolate’, whose interaction with the rest of the world is really negligible, and chance causes 
operate within the isolate. 

In case (3), on the other hand, we are dealing with an imperfect isolate. The outside world, 
in the shape of the statistician, interacts with our isolate to an extent not negligible in 
practice. Fortunately, in this case we are able to construct a smaller isolate, consisting of 
the nine surviving plants, in which the interactions with the outside world are negligible, 
In case (2), there may be some doubt as to what isolate we are discussing. If we regard soil 
pests and such things as included in the isolate, and represent them as a stable set of chance 
causes, then we are entitled to analyse as m case (1); but if the pests are not included in the 
isolate, we should analyse as in case (3), 

Statistical tests are applicable to at least two types of experiment. First, to experiments 
in which the isolate studied contains within itself a system of chance causes which may 
influence the results. And second, to experiments in which the isolate studied is not a ‘ good’ 
isolate, and the residual interactions with the rest of the world may affect the results. There 
may also be mixed cases. 

The distinction between the two types may also be brought out iii relation to the necessity 
or otherwise of an ‘artificial’ randomization procedure, using random digits or the like. 
In the first type, such an artificial randomization procedure is not strictly necessary; for 
example, with our bag of seeds, the bag itself, and its physically indistinguishable contents, 
forms a perfectly adequate randomizer. We have in this case, as it were, an impermeable 
shield around the system, which prevents any external shocks from affecting the system, 
In the second type of experiment, we need to ensure that the interactions with the outside 
world will not mask the results we are interested in; and if we cannot ensure a practically 
complete separation from the outside world, then the effect of external intereactions must 
he randomized, by a special procedure. The randomization here acts like a shock absorber, 
specially placed around the experiment to distribute external shocks evenly through the 
system, 

In the first type of experiment, the reference class to which the significance level applies 
is in fact the set of indefinite repetitions of the experiment in question. In the second type 
of experiment, the reference class is an ideal set, in which the accidental influences of the 
outside world repeat themselves exactly, while the effect of these accidents on the system 
varies as a result of the special randomization. 
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ON THE DISTRIBUTION OP THE RANK CORRELATION 
COEFFICIENT x WHEN THE VARIATES ARB 
NOT INDEPENDENT 

By WASSILY HOFFDING 


I. Iktrobtjotion- 

1. Consider a population distributed according to two variates x, ?/. Two members 
2/i) 2 / 2 ) of population will be called concordant if both values of one member 

are greater than the corresponding values of the other one, that is if 

< ^2> 2/1 < 2/2 > 2/2* 

They will be called discordant if for one member one value is greater and the other one smaller 
than for the other member, that is if 


% < 2/1 > 2/2 or Xi > < y^. 

The probability p that two members drawn from the population at random without 
replacement are concordant will be called the probability of concordance, the probability 
q that they are discordant will be called the probability of discordance. 

In the following only populations will be considered for which the probabilities of 

2/i = 2/2 so that p + ^ 1. (1) 


The main types of such populations are (a) an infinite population with both x and y 
distributed continuous!}^, (h) a finite population where all values of x and all values of y are 
different among themselves, Tlie condition that the two members are drawn without replace¬ 
ment is, of course, only relevant in case (6). 

For a sample of n members drawn from the population, the probabilities of concordance 
and discordance are defined in the same manner as for the population. They will be denoted 
by p' and q' to distinguish them from the population values. If for the population (1) is 
fulfilled, it may he assumed that all values of x and all values of y in the sample are different, 

p' + g'-l. (2) 


It follows from the definition that p' is the relative frequency of concordant pairs among 
the pairs which can be formed from the members of the sample. 


The probability of concordance expresses an essential property of a bivariate distribution. 
It may in itself be considered as a measure of correlation, p' is an estimate of p; it will be 
shown that the mean value of p' is p. If a coefficient lying between the limits — 1 and +1 is 
preferred, the quantity r === p' -= 2p' -1 (3) 

may be taken. 


2. The quantity p hero termed the probability of concordance was apparently first 
considered by Esscher (1924) who also used the quantity 

1 


D 


Ux-l n 

S S sign {Xi - «,.) sign (y,. -y^), 
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(where Xi, yt, i - I, are the sample values of the variates) which is the same as the 
ooefacient t as defined by (3). Bsscher showed that if x and y are normally correlated with 
correlation coefficient r, the expectation of i) = r is 

2 

^?(t) =-sin-^r. (4) 

Hence,from this equation,hesuggestedestimatingrfromranked data by means of therelation 


. TT ■ 

r = sm-T: 


sin7r(25'-|). 


For the variance of t Esscher found in the case of a normally distributed population 


where 



71 — 2 

var(T) = + 


1 /2 . 
-- -SI 

9 \n 




ipq 


1 /2 ■ 1 
= 1— -sm“^i 

V 


(5) 


While Esscher saw inp' and D = t only a means for estimating r, Lindeberg (1926) stressed 
the.significance of the probability of concordance itself for judging the degree of dependence 
between the variates. He proposed for that purpose the coefliicient 


P = 1002}'-60 = 50 r , 


called by him Korrelationsprozent, Lindeberg also gave, without proof, a formula for the 
variance of in the general case of correlated variates (see (13) below). 

Jordan (1927) suggested using, instead of Lindeberg’s P, the coefficient later termed by 
Kendall t. 

Kendall (1938), independently of the above authors, proposed r as a measure of rank 
correlation. He completely solved the problem of the sampling distribution of r in a universe 
in which all possible rankings are equally probable, showing that it rapidly tends to 
normality for increasing n, 

3. The main object of this paper is to show that the samplihg distribution of (and hence 
that of r) tends to normality as for any population with continuously distributed x 
and 2/ if a certain condition is fulfilled (Part IV), In addition, Lindeberg’s formula for 
the variance of p' is proved (Part II) and extended for a finite population (Part V). Finally, 
in Part VI the problem of estimating var {p') from the sample is considered. 


II. Mean value and variance oe p ' in the oase oe an ine^nite population 

4. Consider a sample of n drawn at random from an infinite population with continuous 
X and 2/. Replace the values of x and of y in the sample by their ranks and arrange the 
members of the sample so that the ranking of a; is 1, 2, Then the ranldng of ^ is a 
permutation tt / \ 

01 the numbers 1 ,..., n. 

Let I and J be the numbers of inversions in the permutations (tt,,.,tt.. i, ...,7ri) and 
(7ri,...,7rJ.Then 2/ 2J 


p = 


' n{n-iy ^ n{n-iy 

Thus the knowledge of the permutation H corresponding to the given sample is sufficient 
for evaluating p\ 
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6. Let P(n) be the probability of drawing a random sample represented by the per¬ 
mutation n. Let ^^'(n) be the probability of concordance for such a sample. Then 

p ^ sp(n)i^'(n), (7) 

where the sum is extended over all permutations H of numbers. 

The right-hand side of (7) is equal to the mean value ofp'. Hence 

jE?p' = (8) 

Consider, in generalization of the probability w' that among m^n members drawn 
from the sample at random without replacement, certain pairs of members are concordant; 
for instance, among four members A, P, (7, D, the pairs AB, AO, AD; or the pairs AB, GD, 
etc. Let w be the corresponding probability for the parent population. Then it is seen in 
the same manner as with p' that ^ ^ 

Thus, if we can express the probability of drawing fi concordant pairs from the 

sample, replacing each pair after drawing it> by probabilities wdthout replacement of the 
type w\ we can also, in virtue of (9), represent E{p^Y by population parameters of the type w. 

6. Now, the probability of drawing from the sample one concordant pair and, after 
replacing it, of drawing again a concordant pair, is the sum of the following three probabilities: 

(а) the probability of getting the same pair in both drawings j> multiplied by 

the probability that this pair is concordant (p'); 

(б) the probability that the second pair has one member in common with the first pair 

2) 1 , multiplied by the probability, say that among three members A, B, 0 

drawn from the sample without replacement, one, say A, is concordant with the other two; 
- (c) the probability that the second pair has no member in common with the first one 

((^ 2 /(s)) * probability that among four members A, B, G, D drawn 

without replacement, two pairs without a member in common, say AB and CD, are con¬ 
cordant. The latter probability may be denoted by (p^Y since the corresponding probability 
for the infinite population is p^. 


Thus, 


and, applying (9), 


i^){pr=p' + 2{n-2)k' + Y^yp^y, 


B(p'Y = ^ + 2(n — 2) ^ -f- 


Hence, we have for the variance of p' 


or 


j(p7. 

(10) 


(11) 

a 

CO 

1 

! 

(12) 

2) {k~p^). 

(13) 


This is identical with the formula given without proof by Lindeherg (1926). 

7. In the case considered by KendaU where all permutations 11 of n numbers are equally 
probable, the permutations of m < also are equiprobable. Hence 

P = P(l,2)-g-P(2,l) = i. 


13-2 
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Further, representing k as the mean value of k' in a sample of 3, we find 

k = P(123) + iP(132) + iP(213) = + i + 

Inserting these values in (13), we have 


var (f>') 


2n+& 


var(T) = 4var(^') 

in accordance with Kendairs formula. 


2(2^1+ 6) 
9n(7i-l) 


III, Some algebraic formulae 

8. We shall now consider some algebraic relations to be used in the proof of normality 
of fp' for large n. 

Let/^(p) be a polynomial of degree d in p. Then 

0 if d</?, 


p=0 \p/ 


if d = 


where is the coefficient of the highest power in f^{p). 

To prove (14) write ffp) = + ..., 

where pt®'= 1, p™ = p(p-1)...(p-iJ+l), (^>1). , 

Then (14) follows from the fact that 

i (- ly-Pi^\ pm = 

p=o \P/ p-i=o XP — oJ 

is equal to ~ ly-^ = 0 if f-S> 0 and to /?! if /? = S, 

9. For any non-negative integer v we may write 

r? = 4) (n - a)M + [n - ... + (n - a) + <«>. 

We will study certain properties of the coefficients 
From (15) it is seen immediately that 

1 . 

Inserting in (15) n == a+;5 (/? = 0,1,.,.) we have 

[cc+fr = ... 

1 


or 


dW = a^ d%_p = ^ {(a + fr- 


■A > 


(A-1,2,...). 


Hence we find by induction 




If we take this as definition of dj,“^ for k < 0, we have in virtue of (14) 


dW = 0 for /c<0. 


Expanding (a-fp)” we have from (19) 


i S ( 

p! p^O 


-1)^-P 


^P] O*=;0 \0*/ 


(r = 0 W 


1 I 






(14) 


(15) 

(16) 

(17) 

(18) 

(19) 




/> = 0 


( 20 ) 
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^ (v\ 

we have = S L 

<r=0 V^/ ' 

or, putting v- fi = K and noting that, by (20), = 0 for cr> k, 

ET^O W / 


( 21 ) 


We have the recurrence relation 

4+1,*: - 4“^ = (a + +1 - /f) 4“i-i (22) 

which can be obtained by multiplying (15) by w, then writing down (16) with +1 instead 
of and comparing coefficients in both expressions. 

10. We prove now two properties of the coefficients 
(I) is a polynomial in v of degree 2a:, the term of highest degree being 
In virtue of (16) this is true for /c = 0, And if it is true for /c -1, the highest term of d^j^x,K ^ ^vk 
is, by (22) with a j= 0, — 1)!, and hence that of by a well-known theorem, 




,j2< 


2^2^-i(/c-l)l ■ 2^A:r 

(ii) 4t-p|« is a polynomial in t of degree 2/c with the highest term (—11'' j2 “k !. 


From (21), 




^t—p—a-fK—aiy ‘ 


In ^ j the highest term in t is 




In highest term in t is 

In (y — ty the highest term in Z is (— 1 )^ 
Hence, in the highest term is 

ft. ^ combinatorial meaning. 

Let ^ ^ 




, s;(v) = s'; 


v\ 


where 2 indicates summation over all ^ 0, S' over all ,andinboth cases Vi4-... -t-v^ = v. 
Syy (i^) is the number of ways of allocating v objects on /? places, and S^ (i^) is the number of 
ways of allocating v objects on ^ places in such a manner that no place remains empty. 

We have S^ (j^) = 

and a little consideration shows that 

2^(v) = + + + 


Comparing this with (17) we see that 

1 V., 


Vf/ 


v! 


1 
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IV. Peooi’ of normality of p' for n->oo 

12. Any set of different pairs of elements belonging to the population will be briefly referred 
to as a system (two pairs being different if they have no more than one element in common). 

If we represent the elements of a system by points in a plane and the pairs of elements by 
lines joining the points, we h&ve a, pattern corresponding to the given system. Two systems 
will be said to have the same pattern if there exists a one-to-one correspondence between the 
elements of both systems such that if two elements of one system form a pair, the two corre¬ 
sponding elements of the other system also form a pair. Thus the only thing relevant in a 
pattern is the lines connecting the points, the position of the points having no significance. 

A pattern will he called simple if one can pass from any point of the pattern to any other 
one along lines belonging to the pattern. A composite pattern is a pattern consisting of more 
than one simple pattern. 

If the elements of a system (or the points of the corresponding pattern) are denoted 
by different letters A, B, 0, ..., each pair of the system can be represented by a pair of 
letters. All systems of one pair have the same pattern {AB). There are two patterns 
of two pairs, one simple and containing three points [AB, BO) and one composite and 
containing four points (AB, CD). There are five patterns of three pairs, three simple 
(AB, BC, GA\ AB, BC, CD-, AB,AC,AD), one consisting of two different simple patterns 
(AB, OD,DE) and one consisting of three equal simple patterns (AB, CD, EF). 

13. If a simple pattern consists of points and pairs, 

+ (24) 

For this is true for = 1, and by adding one pair to a simple pattern, at most one point 
is added if the new pattern is to be simple again. 

Denote the different simple patterns by S^, 8^, ..., where 8^^ stands for the one-pair 

pattern and 8^ for the two-pairs pattern [AB, BG), aU 8j withy > 3 consisting of three or 
more pairs. Let a,, be the number of points and hj the number of pairs in 8^ Then Ui = 2, 

= 1, hg = 2, if y>3. (26) 

Consider a pattern P composed of simple patterns 8^, yg simple patterns 8^, etc., and 
containing a points and b pairs. Then, writing symbolicaUy 

P = ^yA> 

we have a = 2yja^, b — Hy^bj. 

In virtue of (24), 36 — 2a = S y^(36^ — 2a^) > S 7^(6^ — 2), 

and from (25) 36 — 2a > — yj^, (26) 

the sign of equality holding if, and only if, pattern P contains no other simple patterns than 
8^ and 8^. 

14. [p'Y is the probabihty that p pairs of elements drawn from the sample, replacing 
each pair after drawing, are all concordant. We may write 

(py = liAiW'i, 

where A^ is the probability that p pairs are drawn from the sample in such a way that the 
system of different pairs among them has the pattern P^, and if is the number of points 
in Bi, w'i is the probability that if elements are drawn from the sample without replacement 



Wassily HoOTBrna 189 

and paired according to patterni^, all pairs of are concordant. The.summation is extended 
over all patterns with no more than /f pairs. 

Since the probabilities are of the type for which formula ( 9 ) is applicable, we have 

E(p’y = ( 27 ) 

where, as usual, w^ is the population probabihty corresponding to the sample probability w'^. 

15 . Consider a term Aw in ( 27 ) corresponding to the pattern 

withy = simple patterns, a = Sy^ct,. points and b = Sy^6^ pairs. 

Let , P = I.yA 

M 

he the pattern obtained from P by excluding the single-pair patterns Sy Then 

y = y — yi, 5 = o —27^ and & = 6— y^ ( 28 ) 

are the numbers of simple patterns, points and pairs in P. 

We have w = prw, ( 29 ) 

where v is independent of p and y^, only depending on the pattern P. 


16 . The probabdity A will be studied, in the first place, as a function of n and y^, while 
its dependence on P will be considered later and only in a special case. It must be borne in 
mind that, by ( 28 ), y, a and 6 also depend on y^. 

Let Qi) Qi! ■■■,Qb Le the pairs of pattern P numbered in some definite order. Suppose 
pair appears times (/? = 1,..., 6). Then 


/ii-f...+/** = y, (/?= 1,...,6). 

Let Pi,Pg, ...,Pyi be the total set of the pairs drawn, numbered independently of the 
order in which they appear. Then pi P’s are equal to Q^, fi^ P’s are equal to Q^, etc. 

Let B be the probabihty that among p pairs drawn from the sample, replacing each pair 
after drawing, b pairs are different and arranged according to pattern P, pair Qp appearing 
times (/? = 1,..., 6) and the fi pairs being drawn in a definite order, say P^, Pg,...,P^. 
Suppose, Pi is a Q^. Since any pair drawn may be taken as Qi (only the relative position 
of the pairs being relevant), the probabihty first to draw Pi is 1. The probabihty that the 
second pair drawn is Pg depends on whether Pg has no, one or both elements in common 


with Pi. In the first case, it is 


W —2 
2 




in the second case 2[n — 2) / ( I (the factor 2 


arising from the fact that each of the two elements of can be the element common with 

In general, if the first A pairs drawn are ..., and if they form a pattern P' containing 
a different elements, the probability that the (A 4-1 )th pair drawn is depends on whether 


Pg)) ill the third case, 1 


11O9 on® 01^ both elements in common with P'. In the first case 




in the second case, &{n — a) 


and in the third case, c" 


where c' and c" are 


independent of n. If, in the last case,-P^+i is equal to one of the preceding P’s, c'' — 1 . 
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B is the product of all such probabilities, and it is seen from the above consideration 


that it is of the form 


5 = G(n-2)to- 




where G is independent of n- 

We also see that a pair which has already appeared before makes no contribution to G. 
Hence, 0 only depends on the different pairs of pattern P, and is independent of the 
numbers 

The above reflexion further shows that for any simple pattern contained in P, the pair 
drawn first, having no elements in common with the preceding pairs, contributes to Q the 
factor except for the first pair, which yields the factor 1, Thus, G contains the factor 
2 -y+i - is obviously the sole contribution to G from the y-^ single pairs 

(pattern S-^ contained in P. Hence 

■ B = l-nC' {n - 2)[»-2i ^ 

where C is independent of n and and also independent of the order in which the 
single pairs are drawn. 

A, the probability that fc pairs drawn form pattern P, irrespective of the order in which 
they appear, depends on n in the same way as P. As a function of A contains, besides 
2 “^^ the factor Ijyil owing to the fact that the y^ single pairs are interchangeable. Further 
it contains the factor which indicates the number of ways of allocating objects on 
b places so that no place remains empty. In virtue of (23) we have 


Thus» A is of the form 


where 




and D' is independent of both n and y^^ and only depends on the pattern P containing no Si, 
Inserting (29) and (30) in (27), we have 

^ Eipy =-E Difipn v{n - 2)t2ri+5-2i, (32) 

the summation taking place over aU patterns with no more than fi pairs. (32) also holds 
for = 0 if by a ‘pattern of 0 pairs ’ we understand the case ~ ^ and take (31) 

as definition of with suitably chosen B'. with 0 is defined by 

pMl(p + 5)W] = 1. 


17. If mAp') = E[p'-pY, 

■Applying (32) with p^^v-d, y^ = k-8, 

we have for the coefficient of p'^v in ^p^{p') 

Jo ^(s) ( 2 ) ' ~ 2)[5-^2-2J-21 


E[p'y-r 


Jo ^ i ^ C) 
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Inserting here, in accordance with (16), 

id-p 

(r=0 ’ 

V /v\ 1 S- /P\ 2 S~p 

we have S (- 1 )*L ts S S 

s =0 \OJ ^ p=0 \ p / (7=0 

Putting a = a + 2K—2—p—a‘, we have for the coefficient of p^v[n~2f‘*^ in 

( 2 ) 4”’“’= (34) 

where a<;.-)(5) = (-1)^ Q4n*^,if,U-p-2- (35) 

Since = 0ifa + 2A:—a—/?“-2<0, the upper limit of p in the summation 

may be taken as a + 2 a: — a — 2 , which is independent of d. We have then, in virtue of (31), 

1 _ a+ 2 /c-a -2 m 

where D' is independent of S. 

In virtue of (I) and (II), para. 10 , ^ polynomial in d. The degree of the {p + l)th 

term in the sum in (36) is /)+ 2(^4-2a:— a-/?-2), which is highest for /) — 0 . Hence, the 
degree d of is 

d = 6-4-2(v — 6 — A:) + 2(a + 2A: —a—2) = 2(v4-a4- a: — oc—2)“&. (37) 

Now, according to (34) and (14), - 0 if d < v, or, in virtue of (37), 

= 0 if 2a>v--4 + 2d —6 + 2a:, (38) 

Applying (26) for pattern P, we have, since :s= 0 , 2 a^ 3&, and consequently 

2a—6 4“ 2/c ^ 2(6 H- a:), 

the sign of equality holding if and only if pattern P contains no other simple patterns 
than and 

Remembering that, according to para. 16, b we have in virtue of (33) 

5 + a: = 6 + 71+^ = b-{^S^fc^S = V. - (39) 

Thus in any case 2 d— 6 + 2a: < 2 v 

and, in virtue of (38), = 0 if a^fv—1. (40) 

If P contains at least one simple pattern with more than two pairs, we even have 

2 d- 64 - 2 A:< 2 v, 

and consequently = 0 if —2. (41) 

Trom (40) it appears that the degree in of L 1 f^y(p') is 


= |v-2 if V = 26., 

<3^—1 —f if ^ — 27^+1. 

Thus, in P 2 fi-\^iiP')y if expanded in powers of n, the degree of the highest term is 

<3/i-l-46 = -A— 1 . 
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In virtue of (13), the degree of the highest term is -1, provided that 

Hence, the degree of _ 

°<-2h+l\P ) - 

is < - A -1 + A + = — J. It follows that 

a2ft+i(iP')^0 if k-p^>0. (42) 

(A: < 0 is impossible since in this case var {p') would become < 0 for large n.) 


18. As we have seen, we may write 

!vi\ 2 ?t~l 

( 2 ) + iJ^(« - +.... 


Then it follows from (41) that only contains terms depending on patterns S-^ and 8^, 
that is, A is of the form 

= (43) 

K A 

The only terms 1 2 1 which can contribute to this sum are of the form 

- 2prrW--2j^ 

The pattern corresponding to such a term is 

P - 71^1 +A/Sg, 

Eemembering the considerations in para. 16, we see that in each pair drawn first 

contributes to G the factor | (except if it is while the first drawing of the other pair 
yields the factor 2, Hence, the S 2 S make no contrihution to G, and we have 


The contribution of the patterns 8^ to A is twofold: since in each 8^ the two pairs may be 
interchanged, this gives the factor (1/21)^; and since the A patterns 8^ may be interchanged, 
we have the factor 1/A!. Thus 




(yi + 2A)! 


and, in virtue of (31), since b = 2A, 

X)' = — I — 
2^-iAr 


(44) 


Inserting in (38) v = 2h, a-Zh-%, a = 3A, h = 2A, 
we see that 4= 0 is possible only if 


(45) 



/c + 2A>2A. 


On the other hand, from (39), 

K + 2X^2h. 


Hence, 

k = 21i — 2A. 

(46) 

Inserting this in (43), we have 

\—0 


Ph 

(47) 




^ s (■ 




where 
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According to (37) in connexion with (45), (46), the degree in S of is 2h. The 

highest term, is contained in the term corresponding to p = 0 in (36). Inserting in 

(36) the values from (44), (46) and (46) and putting in the sum p = 0, we have 


>1 ^Sih-S.O^S.h- 


A! 

Thus, in virtue of (16) and (II), para. 10, 

1 oaft- 2 A 

" AJ^ ^ . 2 ^-^(A-A)! 


A- 2 ^) 

A • 


(- 1 )^-^^ 


2^-^hl U/‘ 


According to (14), 


^(2A.3A-2^ (-l)^-^(2fe)! /A 




U/‘ 


Inserting this in (47) we have 

i2h)\ ft 


2 ft-%!A=o 
The highest term of i^znip') is thus 


{2h)\ 


2 ft-%!A' ^ Ur 2 ft~iAr* 


2 ft®(ifc-^)a)ftn-ft, 


M 


that of —p^) and hence 






if ^“-^^>0, 


(48) 


From (42) and (48) it follows according to the Second Limit Theorem that the distribution 
ofp' tends to normality as 7^-^cx», provided that the marginal distributions are continuous 
and Jc—p^>0, 

The condition k—p^>0 is fulfilled if the population is distributed normally. For, com-^ 
paring Esscher's formula (5) with (13), we find, since var (r) = 4 var (p'). 


t-p' - 35 - 

The right-hand side is positive if | r | < 1. 


■ sin"-^ - 


n 


V. The vaeianoe or p ' m the case oe a finite population 

19. Consider a sample of n drawn from a finite population of N in which all values of x 
and all values of y are different. For the sample probabilities p', fc', (p®)', ... we write now 

p("), iir\ (p®)<">, .... 

and for the corresponding population probabilities 

pw m, (p®)w, .... 

Equation (9) remains valid and may be written as follows: 

Evr> = vTK (49) 

In particular, = pW. 

The essential difference between this case and the case N = co considered above is that the 
composite probabilities such as (p®)^^^ or (pkf^'> are not equal to (pW)® or p^^U^. Eor 
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instance, (pW)® is evidently the same function of and N as (p'f is of ^ 3 ', k', 

(p^Y and n. Thus we have, replacing m by iV in ( 10 ), 



(60) 

Ar[ 2 ] 4 2 

•nd hence (J'P-(/jp,(!"''>)’ 

(61) 

On the other hand, from ( 10 ) and (49) 



(62) 

which is the equivalent of equation ( 11 ). 

On subtracting (50) from (52) we find 



-[2i';rw-3(JV+.w-l)](^32)<^^}. (53) 

Substituting for the expression in (61), we obtain 


+ 2(n -2){N-2) {U^>-pm^)l (54) 

or Q^ar (p(»^)) = -^3^) + 2(w- 2) |l - |,^j(^iV3_j,W2). ( 55 ) 

For N-^oOy (56) becomes the same as (13). 

VI. A SAMPLE ESTIMATE OF Var {p') 

20 . In the case of an infinite population, let 

| 2 jvar'(p') -=p' + 2{n~2)k'-{2n-2){p^Y. (66) 

Then, in virtue of (9) and ( 12 ), 

Ey&v' {p') = var(p'). 

On inserting in ( 66 ) for (p^)' the expression obtained from ( 10 ), we find 
ln-2\ 

( 2 j^^J^'(/)=15'(l-p') + 2(»i-2)(fc'-(/)2), 


or 


(57) 

By analogy. 


(58) 


where I is the probability that among three members A,B,0 drawn from the sample without 
replacement, one, say 4, is discordant mth the other two. 

In the case of a finite population of the type considered in para. 19, we define in a similar 
way a statistic vad™^ (p'»)) such that 

(p^'^)) = var(p^^^). 
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We find 

var(«) = W(ir:TK^-r^33)((^ + (1 ~2){N- 2) -^)W“)}. 

(69) 

A comparison between (59) and (64) shows that vari’®^ (p'”-^) is obtained from var(p^”^) 
by interchanging n and N and talting the opposite sign. 


21. Let and \ be the numbers of sample members concordant and discordant with 
J,^= {x^,y^) (v - The probability of drawing first the member A^, and then, 

without replacing it, a member concordant with is ^ . The probabiUty of 

drawing, without replacement, first and then two other members concordant with A^ is 

Hence 

n{n-l){n~'2) 

P' 


Similarly, 


n{n-l)’ 

JJ _ 

n{n~l){n-2y 

(60) 

s/v__ 

w(n- ij’ 

ii{n~l){n-2y 

(61) 


If only the value ofp' or q' is required, the use of (6) may be more expedient than that of 
(60) or (61). If, however, the variance, and hence k' or I', is wanted, the calculation by means 
of the numbers and \ (whose sums are twice the numbers of inversions 1, J) according 
to (60) or (61) is to be preferred. 

If p' > 2 , it is more convenient to calculate g' and V from (61); if p' the calculation of 
p' and k' by (60) is more rapid. In many cases one can see directly from the given data 
whether the concordant or the discordant pairs prevail, before actually calculating p' or q'. 


Since p' +3' = 1, we have var (p') = var (g'). 


and also, in the case ofn finite population, 

var(p(«)) = var (g^’^)). 

If we write down the equation for var (g^”^) analogous to (5,5) and subtract it from (65), 
we have ^ +g<^)’ = 0, 

or = pW’ - q^^' = p™ - g<^h 

Substituting n for N, we have 

k'-V ^p'-q'=T. 


Comparing this with (67) and (58) we see that 

var'(p') = var'(g'). 
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THE SIGNIFICANCE OP RANK CORRELATIONS WHERE 
PARENTAL CORRELATION EXISTS 

By H. E. DANIELS (Wool Industries Besearch Association) 

AND M. G. KENDALL 


1 . All the known tests of significance of rank correlation coefficients are based on dis¬ 
tributions from a populs^tion in which each possible ranking occurs equally frequently, 
i.e.« the null case where no parental correlation exists. We may then say of any particular 
coefficient whether it is significant in the sense that it cannot have arisen with any acceptable 
probability from an uncorrelated population. No tests are known in the case where parental 
correlation exists, and we have not seen the point discussed except in reference to the 
replacement of rank correlations by grade or product-moment correlations. Thus, for 
example, if two rank correlation coefficients are both found to be significant there has 
hitherto been no exact method of deciding whether their difference is significant. In this 
paper we consider the problem of determining confidence intervals for a rank correlation 
when the parent is correlated and develop a test of significance for the difference of two 
correlations. 

2. In testing an ordinary product-moment correlation the problem is enormously 
simplified by the assumption that the population is normal, or the further assumption that 
normal theory holds good even when the parent deviates only moderately from normality. 
Apart from means and variances the population is then completely specified by the single 
parent parameter p and, as is well known, the sample distribution of the estimator depends 
only on p and the sample number n. 

In ranking theory this position no longer obtains. No assumption can in general be made 
about the form of the parent distribution and, in particular, the parent correlation does not 
completely specify the problem. The usual type of variate theory cannot, therefore, be 
expected to meet the requirements. 


3, A satisfactory approach to the problem can, however, be made if the rank correlation 
is measured by the coefficient known as r (Kendall, 1943, chap. 16). We shall then show that, 
for large samples at any rate, the problem admits of a solution. 

Let the population consist of N members. They may be imagined as laid out in the natural 
order 1,2,.N according to the first variate. The rankings according to the second variate 
are then soma permutation of the numbers 1 to N, and this second array of ranks is all w© 
need write down in particular oases. It determines the rank correlation r. Now suppose 


we choose a sample of in one of the 



possible ways. This sample will, so far as the first 


variate is concerned, be in the natural order, and the ranks according to the second variate 
permit of the calculation of a sample correlation t. For all possible samples and any given 


arrangement of the parent members there will be a distribution of 



values of t. 
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i. The sample value of t is an unbiased estimator of t; that is to say, the mean value of 

t in all possible samples is t. For consider the samples of n. Any particular pair of 

( fij „2\ ' 

^ I samples, that is, all pairs occur equally frequently in the 

totality of all samples. In calculating t we assign to any pair +1 if its members are in the 

right order and -1 in the contrary case. Thus the total of the score for all samples is ^ j 

times the score for the population. To obtain t we divide the score for any sample by ^n{n -1), 
and to obtain t we divide the population score by IN{N- 1). Hence if S is the score for the 
population, the mean value (expectation) of t is 

U-2/-S-^ 


E{t) = 


in{n-l) I' 


^N{N-1) 




5. Unfortunately, it is not true that higher moments of t depend only on t. A single 
example will illustrate the point. Consider the ranking of 9: 


5 2316789 4. 

If the 84 = possible samples of three are written down and t evaluated for each, the 
distribution of S (the number of positive pairs) is found to be as follows; 


Values of S 

Frequency 

0 

2 

1 

15 

2 

34 

3 

33 

! 

Total 

84 


The mean of this distribution is 182/84 = 13/6, and since 

f. 

Mn-l) ’ 

the mean value of I is (26/18) — 1 = 0-44. The value of S for the parent ranking is 26 and hence 
T = (52/36) -1 = 0-44, verifying equation (4-1), The ranking 

125936784 

also has r = 0‘44, but the distribution of S in samples of three is now; 


Values of S 

Frequency 

0 

3 

1 • 

16 

2 

29 

3 

36 

Total 

84 
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The second moment of this distribution is 5*429, against 5*333 for the first distribution, the 
variances being 0*734 against 0*639. 

6 . Thus for any parent with given r there is in general more, than one sampling distribution 
of t according to the arrangement of the parent ranks. In short, as mentioned above, the 
parameter r does not completely specify the sampling distribution and in asking the question: 
What is the standard error of i ? we are seeking for an answer which does not exist. 

It will be shown, however, that for any given parent ranking the distribution oft tends to 
normality with increasing n. The sampling properties of t can therefore be specified to a 
first approximation by its first and second moments only, when the samples are not too small. 
Further, it will be proved that for given r the variance of t cannot exceed a certain function 
of T and n whatever the parent ranking. From a knowledge of t and % only, it is thus possible 
to set outer bounds to confidence intervals for r provided n is large enough for the normal 
approximation to hold. The limits obtained in this way are sometimes rather wide, and an 
alternative procedure is to estimate the true variance of t directly from the sample itself 
according to a formula given below. This avoids the loss of efficiency consequent on using an 
upper limit to the variance, but it is not known how large a sample is required for the error 
of estimation to be tolerable. 


7 . The development of the theory is facilitated if we introduce at the present stage a 
notation similar to that used by Daniels (1944). The ith and jth ranks corresponding to the 
second variate are together assigned a score which takes the value 4-1 if the members 
are in the correct order, — 1 if in the wrong order, and is defined to be zero. The ranks for 
the first variate are similarly assigned scores but as the members have been taken in the 
correct order for this variate, the scores are simply = ± 1, i > j; b^i — 0. Next we define 
Cij - dijb^p so that = ±1, according to whether the ranks for the two variates agree or 
differ in order, and = 0, In this notation 

where c - Scyy, i and J both being summed from 1 to N, 

When the sample of n pairs is selected at random from the parent N and its coefficient 
t is calculated, the values of c^y for the members of the sample remain the same as in the 
population. This fact makes r much more suitable for the present problem than the Spearman 
coefficient p whose associated scores do not possess the same property. The sample rank 
correlation is then i = 

where denotes summation only over those values of i and j occurring 

in the sample. 


8 . It has already been proved that E{t) ^ r. To find the variance of t we require 
so consider S 


S denoting summation over all selections of the sample of n from,the finite parent population 

u 

of N members. Let us enumerate the number of ways in which CyyCj^^ and similar products 
with ‘tied' suffixes, such as OyyCy^, occur in the sum. 

Biometrika 34 


14 
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(N — 4\ 

(i) When i, j, h, I are all different the term Cy Cj^i occur witli I _ ^ 1 selections of the 

[N - 4\ 

remaining members of the sample and the contribution of such terms to S is I _ ^ I 
S' meaning summation over all unequal values of i, j, k, I from 1 to N. 

(ii) The term similarly occurs in ^j ways and there are four ways of tying one 

suffix, each of which gives the same contribution to S since c^j is symmetrical. The total 

/ -»-r M. 

contribution of such terms to S is therefore 4( 


/W-3\ ” 


zero since % = 0. Hence 


(iii) Terms like similarly contribute | to E, and all other terms are 


S [e«]> - j) S'c„c,„ + 2("_ 

Expressing the E'’s in terms of the corresponding S's and dividing out by we obtain 


^[( j («')]2 2 Sc ^^ C ,;^) 4 " y^(3) ^( 2 ) 


'^(3) 

where = n[n— 1)(72-—r +1). Since = N{N — 1) and the variance of 

N 

t for given r and n is seen to depend on the value of == Ec|, where = Ec,^^. 

Let N become large. The quantities c and Ec? are respectively 0(iV®) and 0{N^)^ so if we 
introduce = o^jN the value of E{t^) for large N becomes 


'' »(•-!) - 4- 


and hence m the limit the variance of t is 


4(%-2) 

vari = —i--ivarTi + 

n[n—l) 


n{n — \) N 1)’ 


72 .( 72 .’-!) 


( 8 - 1 ) 


9. The variance of t satisfies the inequality 

vari^-(l-T2), (9*1) 

whatever the parent ranldng. Moreover, though the limit may not be attained in any 
particular parent ranking, reasons are given in the Appendix for expecting that it cannot 
be substantially improved upon. The proof is as follows. 

Reverting to a finite parent population of N members, we first seek a maximum for Sc|. 
In terms of the original scores, Keeping = ± 1, i > j, == 0, as before, allow 

the to assume any values subject to the conditions 

The stationary values of Sc| occur when the <^^/s satisfy the equations 
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which give, on multiplying by 6 ^^ and summing j, 

u(N—l) — c 

Thus, unless the c/s are all to be equal, in which case Sc| is a minimum, A and /i must take 
the values A = i^-2, ju^^cjiN-l), 

and since 2Sc| — AN{N — 1) - /{c = 0, 

it follows that Sc| cannot exceed iN(N-1) {N-2) + lc^l{N- L). xillowing N to become 
large, this implies Srf/iV ^ |(1 +t 2 ). 

Hence var 1 — t®), 

2 

and so from equation (8'1) Tar i (1 (9‘1) 


10. Assuming that the sample is large enough for the distribution of t to be normal, the 
roots Tj, Tg of the equation 

( 10 - 1 ) 


i.e. 



( 10 - 2 ) 


provide coniidence limits to r when t is known, x being the standardized normal deviate 
corresponding to a given probability of P %. These confidence limits are of course maxima, 
in the sense that we shall be wrong in at most P % of the cases in asserting r to lie between 
the calculated limits. 

In our proof of the tendency of t to normality it will be necessary to neglect terms of order 
and the sample may have to be rather large for such terms to be small, unless r itself 
is small. 

The form of equation (9T) suggests using 


w ~ sin““^ t 


instead of To the same order of approximation we can take to as having a normal dis¬ 
tribution with mean - sin'-^T and standard error hot exceeding ^|{2jn), which is indepen¬ 
dent of r. This form is more convenient for assigning confidence limits to t, and for testing 
the significance of the difference between t^ and (whose standard error cannot exceed 
7 [ 2 (l/ni-f- 1 /^ 2 )]), but we have not been able to discover whether the transformation brings 
the distribution nearer to normality. 


11 . We now prove that the distribution of t tends for large n to normality whatever the 
parent ranking, provided that | r | is not near unity. 

Write so that = 0 , and g^^ -cjN^ = - (iV- The 

rth moment of about its mean value is consider 

n n 

the summation S being over all possible sample selections. 


14-2 
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The argument used by Daniels (1944) to show that in the null case the distribution of rank 
correlation in large samples tends to normality can be applied with little modification to the 
present problem. The proof is therefore sketched here without much detail. 

Two essential conditions to be satisf ed are that = 0 , which is true by definition, and 
which is true.only if 1 -t* = 0 ( 1 ), so that the tendency to normality may 
be expected to break down for high correlations. 

The sum S is evaluated as in § 8 by counting the number of ways in which terms like 

n 

and similar terms with tied suffixes, occur. In this way it is expressed as a linear 
combination of Every such S' is replaceable by the corresponding S 

together with terms containing more tied suffixes which are of lower order in N since they 
involve fewer summations from 1 to JV. 


12 , First consider the even moments with r — 2 m, Terms containing more than 3m 
different suffixes must vanish, since in such cases it is impossible to avoid at least one 
with two free suffixes, and = 0 , For the same reason the only non-vanishing terms with 
3 m different suffixes are those containing expressions like 

*** ” » 

and terms with fewer different suffixes are of correspondingly lower order in N, 


Witii 3m suffixes assigned there are 


p^3m\ 
\n-“3m / 


ways of selecting the remaining ri — 3 m 


members of the sample, and the suffixes can be tied in 


( 2 m)! 22 "^ 
m!( 2 !)« 


ways to give the same result. 


Dividing out by and noting that j j ^ ^hen both N and n are large, 

the contribution of such terms to /^ 2 m> 2 mth moment of about its mean, is found to be 

^ 3 m ( 2 m)! 

which is of order Moreover, by the same argument, terms with/< 3 m different suffixes 
add contributions of order which may be neglected. 

( 2 m)!, 


Hence 




the neglected terms being relatively 0 ( 7 i“^). 




13, For the odd moments let r == 2 m + l. Similar considerations show that the non¬ 
vanishing terms of S cannot have more than 3m + 1 different suffixes, and /^ 2 m+i therefore 

n 

of order 

Then since has even moments of unit order and odd moments of order the odd 
moments may be neglected to that order. We conclude that is distributed normally for 
large n with variance 4^3 

Jp^9if9ik = 47i.3varr^, 

and t is similarly normal with variance ( 4 /?i) var r^. 

14. The fact that terms of order have to be neglected suggests that the normal approxi¬ 
mation only holds good for fairly large samples. This is not surprising since one would expect 
skewness to be an important property of the distribution of t when r is not zero, if only for 
the reason that 1 1 1 can never exceed unity. It seems worth while to examine the odd moments 
in more detail. 
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The dominant term of the (2mH-l)th moment has 37??, 4-1 different suffixes, which can 


occur as 


9ijc 9 ii{'^9u'i} 9u 






Both can be obtained in 


{2m+1)12 ® _ (2m +1)! 2'"+2 
(2!)“-i3!(m-ljT 

'N — 3m — 1 


3!(m —1)! 

ways of selecting the sample with 3m +1 suffixes 


. , j A /N — Sm—1\ 

distinct ways, and there ^re ( _ ^ I 

assigned. The (2m+ l)th moment of about its mean is therefore 

.,.^3?n+i (2m+l)!2™+2 

/^2m+i ^3?n+i 3 !(m — 1)! 

ignoring terms of relative order O(n-r). The corresponding moment of t is obtained to the 
same order on dividing by it depends only on vari and where 


/Igit)> 






JV 


where = S^y. The distribution of t is thus specified to 0(w~r).))y gj-gt three moments. 
The moment-generating function of the distribution of t in standard measure is 


M(z) = |l + 0{n--^)l 


where — 0{n^^), 

and the frequency ,distribution of a; = (^-T)/A/(vari) is* 


m = (i 


’V.dxy^{2iT) 


[l + 0(n-i)]. 


(14-1) 


15. The effect of the y^ term in modifying the confidence limits based on normal theory 
can be seen in the following way. Lot ^ be the normal deviate whose chance of beiag exceeded 
is P{^), The chance of x exceeding ^ is, from (14:T), 

r(l) = p®+^‘(P-i)^)- 

If X is the correct limit such that F{X) = P{i), it is readily proved by successive approxima- 

.1) (15-1) 


tion that the formula 


X = £+^^(f- 


gives the appropriate value of X to 0{n-^). For example, the 5 and 1 % limits are respec¬ 
tively + l'96-i-0-474yi and + 2-68-t-0'941yi. 

16. In practice the value of var i has to be estimated from the sample, and although its 
standard error can he shown to be 0(n~^) by the Icind of argument already used, it is not 
known how large the sample has to be before the error in estimating the variance can bc 
safely ignored. It is best to use the unbiased formula 

1 


var< = 




(16-1) 


n{n— 1 ) (n — 2) (ri — 3) 1 

(which is easily proved) in calculating var^ from the sample, especially if the standard error 
of the mean value of t from a number of small samples is required. 

* Note that the approximation error in f[ic) is relatively 0{n'-i), a stronger result than would he 
obtained from a Gram-Charlier approximation based on the first three moments ordy. 
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As the tern in yj is a small correction it is perhaps sufficient in moderate samples to take 

and /‘3(0 = A<^. ri = /«3(0/(’5^a,r0», (16-3) 

where the first term in G is the sum of Cific^ + off over all values of i >j. The unbiased formula- 
for involves some rather tedious computation. 

17. To illustrate the methods of the paper we consider an actual example. 

A set of thirty wool samples were visually graded in order of fibre fineness by three assessors. 
The mean fibre diameter for each wool sample was also determined by direct measurement. 
Table 1 shows the measured order (ilf) compared with that of the three assessors (A, B, 0), 
in ascending order of experience. 


Table 1 


M 

A 

B 

G 

* M ■ 

A 

B 

0 

1 

5 

2 

1 

16 

12 

14 

16 

2 

4 

6 

2 

17 

10 

18 

16 

3 

9 

6 

6 

18 

30 

21 

26 

4 

3 

1 

3 

19 

22 

26 

24 

6 

6 

7 

4 

20 

16 

22 

19 

6 

2 

4 

6 

21 

21 

16 

18 

7 

16 

19 

10 

22 

29 

20 

23 

8 

18 

3 

12 

23 

28 

25 

22 

9 

8 

8 1 

7 

24 

19 

27 

26 

10 

11 

9 ' 

8 

26 

23 

28 

21 

11 

17 

13 

9 

26 

20 

23 

27 

12 

13 

10 

11 

27 

7 

24 

20 

13 

24 

17 

17 

28 

26 

29 

28 

14 

14 

12 

14 

29 

27 

15 

30 

16 

1 

11 

13 ! 

30 

26 

30 

29 


The method of working will be seen from the matrix for the MA correlation shown 
in Table 2. 

The correlations of the assessors’ orders with the measured order are found to be 

0-490, ts = 0-'724, %= 0-816. 

(i) Consider first the maximum confidence limits given by (10-2). The 6 % limits are 
- 0-02 0-80, 0-23 <«;,< 0-92, 0-34 <«o< 0-96. 

Again, using the transformation w = sin-’-l, the 6 % limits are 

0-01 <«_^< 0-85, 0-30 <iB< 0-97, 0-46 <io< 0-99. 

The values of w are = 0-612, Wj^ - 0-810, = 0-954. 

The greatest difi'erence is 0-442, and the upper limit to its standard error is V(4:/w) = 0-365, 
so on these grounds the difference between A and C would not be judged significant. 

The 5 % limits are very wide, and the lack of significance is disappointing since G was 
known to be an expert appraiser while A is relatively inexperienced, and one would have 
expected an obvious difference between them. 
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(ii) The variances estimated from the unbiased formula (16-1) are 

vari^ = 0-006630, vari^ = 0-006067, vari^ = 0-002198. 

The estimated standard errors are therefore 

= 0-081, 5j5 = 0-071, sa== 0-047. 

The 5 % confidence limits, assuming normality, are 

0-33<i4<0-65, 0-58<ts<0'86, 0-72 <to< 0-91, 

Table 2 


C,7 Cf 


0 

- 

4 

- 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

21 

- 

0 

4 

— 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

21 

+ 

+ 

0 

— 

— 

— 

4 

4 

— 

4 

4 

4 

4 

4 


4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

17 

-- • 

- 

- 

0 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

19 



- 

4 

0 

- 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

23 


- 

- 

— 

— 

0 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

17 

+ 

+ 

4 

4 

4 

4 

0 

4 

- 

— 

4 

- 

4 





4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

13 


+ 

4 

4 

4 

4 

4 

0 

- 

“ 

- 

- 

4 

- 


- 


4 

4 


4 

4 

4 

4 

4 

4 


4 

4 

4 

9 


4 . 

- 

4 

4 

4 

— 

- 

0 

4 

4 

4 

4 

4 


4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

19 


+ 

4 

4 

4 

4 

— 

- 

4 

0 

4 

4 

4 

4 


4 

- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

19 



4 

4 

4 

4 

4 

- 

4 

4 

0 

- 

4 

- 


- 

_ 

4 

4 

- 

4 

4 

4 

4 

4 

4 


4 

4 

4 

13 



4 

4 

4 

4 

— 

- 

4 

4 

- 

0 

4 

4 



- 

4 

4 

4 

4 

4 

4 

4 

4 

4 

.... 

4 

4 

4 

15 


+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

0 

- 



— 

4 




4 

4 

— 

— 

- 

- 

4 

4 

4 

7 


+ 

4 

4 

4 

4 

- 

- 

4 

4 

- 

4 


0 

- 

- 


4 

4 

4 

4 

4 

4 

4 

4 

4 

-- 

4 

4 

4 

13 


- 


— 

- 


— 

- 

- 

- 

- 

- 

- 

- 

0 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

1 

+ 

4 

4 

4 

4 

4 

— 

- 

4 

4 

- 

— 

- 

- 

4 

0 

- 

4 

4 

4 

4 

‘4 

4 

4 

4 

4 


4 

4 

4 

13 

4 . 

+ 

4 

4 

4 

4 

— 

- 

4 

— 





4 

— 

0 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

11 

+ 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

0 

-- 

- 

- 

- 

- 

- 

- 

- 

- 

- 


“ 

5 


+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

4 


0 

- 


4 

4 


4 

- 

- 

4 

4 

4 

15 

+ 


4 

4 

4 

4 

4 

- 

4 

4 


4 


4 

4 

4 

4 

- 

— 

0 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

17 

4 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

4 



4 

0 

4 

4 

- 

4 

4 

- 

4 

4 

4 

19 

+ 


4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

0 



- 

- 

- 


- 

- 

11 


+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

- 

0 

- 

- 

- 


- 

— 

- 

11 


4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 . 

4 

- 

— 

4 



- 

0 

4 

4 

- 

4 

4 

4 

16 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

4 

- 

4 

4 

4 


- 

4 

0 

- 

- 

4 

4 

4 

17 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 

4 

- 

— 

4 

4 

— 

— 

4 

- 

0 


4 

4 

4 

15 

+ 

4 


4 

4 

4 

- 


- 

— 

-- 


- 

- 

4 

- 


- 

- 




- 

_ 

- 

- 

0 

4 

4 

4 

--11 


4 

•4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

— 

- 

4 

4 

4 

4 

0 

4 

““ 

21 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

- 

4 

4 

4 

- 

- 

4 

4 

4 

4 

4 

0 

- 

21 

+ 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


4 

4 

4 



4 

4 

4 

4 



0 

c= 

n= 

Sc? = 

19 

426 

30 

7470 


Moreover, we should j udge A and 0 to be significantly different at the 1 % level, and A and J5 
at the 5 level. How far these conclusions are valid depends, of course, on the accuracy of 
the variance estimates, but the conclusions seem to agree with what might have been 
expected from prior knowledge of the assessors’ capabilities. 

(iii) The values of calculated from (16*2) and (16*3) are 

y^(A) == -0*32, y^(B) - -0-36, yt(G} - -0*38. 

The distributions would not appear to be very skew, and the distribution of the difference 
of two i’s is probably nearly normal. The adjusted 5 % limits are, from (15*1), 

0*32 <t^< 0*64, 0*57 < < 0‘85, 0-72 <tc< 0*90. 
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appendix 

1. The question arises whether a particular parental form exists for which the variance 
of t assumes the upper limit 2(1- r‘^)jn. We surmise, though we cannot prove, that the maxi' 
mum possible variance is attained when the parent ranking has a ‘ canonical ’ form obtained 
in the following way. Consider again the ranking 

62316789 4. 

The number of positive pairs iS is 26, so that < = 0-44. Let us transform this so as to bring 
the 1 to the beginning of the ranldng but move the 9 so as to preserve the number jS at 26. 
The 1 passes over three members to go to the beginning and hence adds 3 to the score. The 
9 must, therefore, proceed to the left over three numbers so as to subtract 3 from the score 
and we reach 1523 9 678 4. 

Now operate similarly with 2 and 9, reaching 

12693678 4. 

Had our 9 been contiguous to the 1 and incapable of moving farther to the left we should 
have moved the 8 and so on. Proceeding with the process by moving back the 3 and the 9 
and 8 we reach 12395687 4, 

and again 12349876 5. 

All the lower numbers 1 to 4 are in the right order and the remainder are in the inverse order. 
We call this ranking the ‘canonical’ order for given fS (or i). It is not always possible to 
reduce a given ranking to canonical order, but there cannot be more than one individual 
out of place. 

2, Consider the effect of a series of transformations leading to the canonical form. The first 
process, that of moving 1 and 9, will increase the value of S for some samples involving 1 but 
not 9 (leaving the others unchanged), will decrease the value of ^ for some samples involving 
9 but not 1 (leaving the others unchanged), and will, in general, not alter those involving both 
1 and 9. Similarly for 2 and 8, and so on. The effect of the transformation is thus to increase 
the values of iS containing the lower numbers 1, 2, 3, etc., and to decrease those containing 
9, 8, 7, etc. These values of S are themselves, in the canonical form, the greatest or least as 
the case may be. Consequently the progress to the canonical form is accompanied by 
increases in the number of high values of S and increases in the number of lower values, and 
one might expect the spread of the distribution to tend to a maximinn . In the example 
quoted, the distributions of S in samples of 3 for the successive rankings are: 


Values of jS Frequencies/ 


0 

1 

2 

3 

2 

16 

34 

33 

3 

13 

36 

33 

3 

16 

29 

36 

6 

10 

32 

36 

10 

40 

34 

1 Totals 

84 

84 

84 

84 

84 


The sums SfS are all equal to 182. The sums are respectively 448, 450, 456, 462 and 466, 

showing the canonical ranking to have the largest variance of the five. 
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3. There is, however, another way of carrying out this process. If the parent ranking is 
inverted, r becomes — t, but the variance of samples of n drawn from the inverted ranking 
remains the same, by symmetry. We may then reduce the inverted ranking to its canonical 
form and reinvert it so that its coefficient is again t. This ranking we call the inverse canonical 
form. It will be shown that for large N, when t > 0 the inverse canonical form yields a larger 
variance for t than the direct canonical form. 

Even in the example already quoted, the inverse canonical ranking (with one member 
out of place) is 34256789 1, 

which has a distribution 


Values of S 

/ 

0 

2 

1 

27 

2 

10 

3 

45 

Total 1 

84 


The sum S/jS® is now 472, which is greater than the previous maximum 466, 

4. Consider the canonical case when there are N members altogether, i? at the beginning 
in the right order, and N — E in the inverse order. If we select n-j members from the R 
and j from th.eN~R the value of S for the sample of w is |w(n -1) - ^ j(i -1), and the relative 

frequency of U — ^n{n— 1) — is | suppose that N tends to in¬ 

finity and BIN to the ratio p. The relative frequency ofU = \§{j -1) tends in the limit to 

(;) 

where \ -f. The mean value of U is then 


pn-iqi^ 


and since 


S (i -1) j -1) 

217 


\J> 
t=l- 


we must have 
The variance of V is 
and so 


(4-1 A) 
(4-2 A) 
(4'3A) 


varU = n{n — 1) pq^{nq — ^{1 — ^q)}, 
var t 1 Qpq%nq — 1 — 3g')}/w(w -1). 

5. If now the inverted parent ranking is reduced to canonical form, giving ratios p', q' 
corresponding to p and q, we shall have 

q' = A/[^(l + 'r)] (6TA) 

and YSbTt'= lQp'q'%nq' (6-2 A) 

Then since q^-tq'^ = 

varf'-varf = — q){t~q)(t—q')- (5-3A) 

n{n-l) 

When T is positive, q' >Q and var exceeds var t. 
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TESTING FOR NORMALITY 

By R. 0. GEARY, Cambridge, University Department of Applied Economics 

1. Introduction 

The present communication, one of a series, has two main objectives: 

(1) To show that probabilities derived from the well-known analyses of variance and 
other ‘ small sample ’ tables, which postulate universal normality, may differ seriously from 
the true probabilities when the universes are non-normal, even, in some cases, when the 
degree of non-normality is not considerable. 

(2) To determine the most efficient tests of normality from a wide field of alternative 
symmetrical tests. 

It may be useful to summarize very briefly previous work in so far as it is strictly relevant 
to this study. * The modern theory may be regarded as having been initiated by Karl Pearson 
who, in 1895, found the first approximation (i.e. to to the variances and covariance of 

and 6^ for samples drawn at random from any universe and, assuming that the ^Jb■^ and 
^2 were distributed jointly with normal probabifity, constructed ‘probability ellipses’ from 
which the probability of the same values occurring; had the universe, in fact, beeni normal, 
could be inferred very approximately. A considerable advance in moment determination 
was made by 0. 0. Craig (1928). In 1929, R. A. Fisher, in inventing cumulants, simple func¬ 
tions of the sample moments, and formulating rules for finding their semi-invariants, 
developed incidentally a technique for expanding to several terms in 1/w the moments of 
^bi and b^ when the universe was normal. This paper was followed soon after by another 
(1930), fundamental for all succeeding work on this subject, in which R. A. Fisher ingeniously 
applied combinatorial technique to the finding of exact values of the moments of normal 
.Jbi and b^, and gave inter alia the values of the second, fourth and sixth moments of 
and of the first three moments of b^. The fourth semi-invariant, together with many other 
normal semi-invariants of was determined by J. Wishart in 1930, and a further advance 
in R. A. Fisher’s technique was made jointly by R. A. Fisher _& J. Wishart in 1930. In 
1932 Joseph Pepper gave the eighth normal moment of Using R. A. Fisher’s rules 
C. T. Hsu and D. N. Lawley in 1940 gave the exact values for normal random samples of 
the fifth and sixth moments of b^. Using a method due to R. C. Geary (1933) (applying 
C. 0. Craig’s ideas (1928) to the normal problem), R. C. Geary & J. P. G. Worlledge have 
recently (1946) found the seventh moment of b^. 

So much for moment determination. In 1930, B. S. Pearson used appropriate Pearson-type 
curves, apphed to R. A. Fisher’s (1929) approximations of the semi-invariants, to find 
approximate frequency distributions of and b^. From the frequency distributions he 
computed a table of 1 % and 6 % probability points at intervals for n from 50 to 5000 for 
and for n from 100 to 6000 for b^. 

Since at the time the prospect seemed remote of determining the frequency of normal b^ 
on which reliance could be reposed for samples of moderate sizes, R. C. Geary (1935)j‘ 
suggested that the ratio, a, ofmean deviation to standard deviation computed from the origin 

* An excellent accotmt of the development of moment theory up to the year 1930 was given hy 
JT. Wishart (1930). 

t The author was informed by M. Fr^ehet that this test was suggested by Bertrand, but has been 
unable to check the reference. 



210 Testing for normality 

might be used as a test of normality, and gave the 1 and 5 % probability points for this test 
at intervals for normal samples of 6-100. E. S. Pearson compared experimentally Geary’s 
test with bj and suggested, for samples so large that comparison could safely be made, that 
bi was probably somewhat more sensitive than a, a suggestion which will be examined 
theoretically in this communication. In 1936 also, E. 0. Geary showed that there was a 
high (negative) correlation for normal samples between a(l) (see 3*1) and for normal 
samples, and argued therefrom that the former should be nearly as efficient as ba. In 1936, 
R. C. Geary gave a table of 1, 5 and 10 % probability points of a(l) at intervals for samples 
of 11-1001. In 1938, a brochure by R. C. Geary & E. S. Pearson was published by the 
Biometrika Office entitled Tests of Normality, giving tables and diagrams of probability 
points of C6(l) fbi and b^. There is considerable literature deahng with the effect of universal 
non-normality on the normal tests, mostly by way of particular numerical examples: a selec¬ 
tion. of papers on this subject is included in the list of references at the end of the paper, 

2. Eiteot oe non-noemauty 
(a) The z-test 

The effect of universal non-normality will first be considered in relation to the z-test. 

and i/jj, .••>2/n" independent samples drawn at random from the 

same universe (normal or non-normal) it is easy to show that, if 

- 1 ^ n s ® /o 1 \ 

2 = = (2-1) 

'ir«» 1 

then = + (2-2) 

when both W and n" are so large that terms in and n*' of degree less than — 1 are regarded 
as negligible. This is an obvious generalization of the approximate formula given by 
R. A. Fisher* for normal samples, namely, 



It may be useful also to give formulae for the first and second moments from zero for z 
when the two random samples are drawn not necessarily from the same universes, though 
both universes have mean zero and the same variance Ag: 



* Statistical Methods for EesearcJi Workers, 8 bh ed. p. 219. 
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where the A’s indicate semi-invariants of the two universes of the orders indicated. In these 
formulae, in effect, terms to order - 2 in -Ji', w" are retained. 

When both samples are large the frequency distribution of z will approach normality 
provided that is fimte. The effect of universal Ijurtosis can accordingly be assessed in a 
very rudimentary manner from (2-2) and (2-3). The z-deviate ^ corresponding to, say, the 
2i % normal probability point i. ^ 

If, however, the universe were not normal and had, in fact, a variance with /?2 + 3, the 
actual probability of a deviation in excess of ^ in absolute value would be, not 0*05, but the 
normal probability appropriate to a unit variance deviate of On this consideration 

the actual probabilities for different values of /?a, where the assumed probability is 0*05, are 
shown in the fifth column of Table 1. 


Table 1. miffed on probability of z of change in universal kurtosis, for large samples 



MUM^ 


1-9600 V(Mg/Jkf2) 

Actual 

probability 

1*5 

4 

2 

3*9200 


2 

2 

1*4142 

2-7718 


2-6 

1*3333 

1*1547 

2*2632 


3 1 

1 1 

1 

1*9600 


3*5 

0*8000 

0*8944 1 

1*7530 ! 


4 

0*6667 

0*8166 

1*6003 

0*110 

4-5 

0*5714 

0*7559 

1*4816 

0*138 

5 

0*6000 

0*7071 

1*3869 

0*166 

5-5 

0*4444 

0*6667 

1*3066 

0*191 

6 

0*4000 

0*6326 

1*2397 

0-216 


The table shows that, if the universe from which the samples are drawn has y ?2 ~ the 
true probability is about 1 in 6 instead of the assumed 1 in 20. It is, of course, true that 
universes with so large a kurtosis are unusual. This view cannot be held of the range 2*5-4 
for /?2 in which the probability, assumed to be 0*05, can be anything, in fact, from 0*024 
to 0*110. Accordingly, if universal kurtosis is markedly negative, use of the standard table 
masks signifiicant differences; if kurtosis is positive the standard table exaggerates these 
differences. Unless systematic tests have established that kurtosis is negligible the standard 
table should not be used for testing significant differences in variance, 

The foregoing analysis gives a theoretical explanation of the striking experimental results 
of E. S. Pearson (19315) working, however, with a test function 

i=l / U“1 I 

and with sample sizes t?,' = 5 and n" = 20, smaller than those contemplated in the present 
analysis. With 500 samples Pearson showed that when the frequency at the two tails 
together expected from normal theory was 15*4 ( = probability 0*0308) the frequencies 
actually found in symmetrical universes with ^ 2*5, 4*1 and 7*1 respectively were 7, 39 
and 47, equivalent to probabilities of 0*014, 0*078 and 0*094. 








212 Testing for normality 

If tests of normality indioato universal kurtosis, either of two eonrses might he adopted: 

(i) Assume that 2 : is normally distributed with variance computed from (2-2) with 

estimated as kjkl from the sample, k^ and k^ being R. A. Fisher’s (1929) cumulant 
functions, 

(ii) Enter the standard table, not with z computed from the samples but with z V(J^ 2 /-^a)> 
estimating as in (i), 

Eoth of these procedures are, of course, open to the objection that, unless the samples 
are extremely large the estimate of unlikely to he accurate; the real might be larger 
or smaller than the estimate. Any probabilistic inferences should accordingly be accepted 
with reserve. 

It is fortunate that the condition specified in the foregoing paragraphs, namely, that the 
numbers in the two samples are both large, rarely applies in practical applications. It more 
usually happens that the number of classes is small, whereas the number per class is relatively 
large. In this case E. S. Pearson (1931 &) has shown the first approximation to cr| is independent 
of from which he inferred that the actual probability when the total number of samples 
was large was inconsiderably influenced by kurtosis. In view of the foregoing analysis it 
seemed to the writer desirable to carry the inquiry a stage further. 

Suppose, then, that Ic samples are drawn at random from the same universe, in the jtli 

sample, the total S If is assumed that n is so large that terms in are negligible, 

3 

that the number of samples k is small, and that all the are of the same order of magnitude 
as n, i.e. that if & 

n^=-7r^n, S == 1, (2-6) 

none of the is negligibly small. 

Using B. A. Fisher’s cumulant notation with subscript to indicate the sample from which 
the cumulants were computed, the mean for the jth sample is written and its variance 
Then ^ 

z=|logy, (2-7) 


where 

(fc- i)Z = 

j 

so that 

^ = i:7T^(l-nj)^,-2'ZI.n^7ryk,,k,r, 

and 

{n-k)Y=^{n^-l)k,^, 

so that 

II 

where 

, n,- -1 


Without loss of generality let the universal mean be zero and the variance unity. It may 
easily be shown that 

Jill A. = JJj X = 1. 


X X + IX-X) , 

- Y" y {1 + (-X" l)}{i + (?- 1)} 

w2 = {l + (Z-l)}2{l-2(r-l) + 3(y-l)2~4(7-l)3 + ...}.| 


Then 


( 2 - 8 ) 
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We shall compute the approximate values of Ew and Eid^, i.e. the.values to order n-^\ the 
symbol — denotes equal to, to approximation required’. Prom values of the variances 
and covariances given by E. S. Pearson (1931 b) in his equations (9)—(11), we have 


jS?(X-l)(r-l):C=^, 

71 

2 ?(F_l) 2:^.^i±!^ 


(2-9) 


with 


“o = S 
j 


We require |-■“j — 

+ 2 S S TTy + STT^.TT^,) h% lc\y - 4 S S S ^iTTynyil - 37r^) h%h^y k,y., 

33 j j' j" 

say, 


remembering that, by definition of oumulants, 


Ek^f = Ag = 1. 

Also (7- 1 )2 = S+ 2 S S <l>i'<l>yKiKy. 

j>3' 

It will be useful for what follows to note that 


Using R. A. Fisher’s formulae (1929) for formation of joint semi-invariants of and kj,, 
and noting that the h samples are independent, we find from the foregoing 


n{k-l)EX{Y^l)^^{k~l){X^^2), ' 
n{k - 1 )2 EX^{ 

^fc-l)2 1?Z2(7^1)2:^(jfc^^l)(A4 + 2)J 

Then, from (2*8), (2-9), (2d0), 

2 i 

Ew^l + -, 
n 

it” 4^ 1 1 


( 2 - 10 ) 


( 2 - 11 ) 


These are the formulae required. It will be noted 

(i) that the terms free of n~^ are independent of A4, which is equivalent to B. S. Pearson’s 
result (19316); 

(ii) that the formulae (2-11) agree with the normal values 



to n~'^ when A4 = 0; 

(iii) the approximations at (2-11) are free of A3. 



(2-J2) 
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The approximations at ( 2 * 11 ) tend to confirm E. S. Pearson’s result that, when n is large 
compared with k, the effect of universal kurtosis is unimportant. It would be useful, however, 
to compute the approximate true probabUity for different values of Jfc, and a_^. For this 

and fox subsequent work the following lemma* will be found useful*. 


If f[x) and (p{x) are, two frequency densities with semi-invariants and (m = 1 , 2 ,...), 

respectively, then, formally, 

For the present application take as generating function <f) the frequency distribution of w 
in the normal case, i.e. /«,_a\ 


0(M)) = 


lh-Z\ln-k-2\ 




(h—l) 
{n~k) 


(2-14) 

(2-15) 


j r /mix r r/ (P + 2 A: - 2 - a_i) A 4 

and, from ( 2 - 11 ), - n{k-l Y -’ 

Assume that (m4=2). 

Then if the ‘normal theory’ probability corresponding to the sample value w be p, the 
approximate ‘true’ probability, subject to (2-15), will be about [p +p'), where 35 ' is given by 

iW-m 


p 


- dw = — 

J W 




(2-16) 


The term p', of course, merely corrects for the non-normal term in n~^ in the variance of z; 
it takes no account of corrections due to terms of higher (negative) orders in n or even of 
non-normal terms in in semi-invariants (w > 2 ). The calculation is designed merely 
to show whether the standard table probability requires correction for universal kurtosis; 
this will appear if p' is of the order of magnitude of p. 


(b) The t-test 

In Geary’s 1936 paper the expansion to terms in n'~^ of the first four moments of t, where 

(2-17) 

were given. Following are the first six semi-invariants Z of ^ to the same approximation as 
in the earlier paper: 


Z,= 


1 

n^ 




2X54-5X3X4) I -h . 


£ 2=^1 + i(8 + 7Ai)n-H (6- 2A4-fAi-4iA3A5+J3^A|A4) 

— 2A371~* — ( 9 A 3 — 3A5 + ’^A 3 A 4 + -^A 3 )? 1 ”^, 

1 ^ 4 ^ (6 - 2 A 4 +12Ai) + (64 - 18 A 4 + 4A6 + 76Ai - 63 A 3 A 5 - 6 Ai + 81 Ai A 4 + r>r\ 


(2-18) 


-^ 5 — - ( 6 OA 3 - 6 Ag — 2 OA 3 A 4 + IO 5 A 3 ) 

I/6=(240- I 2 OA 4 + 677^A| + I 6 A 3 - 21 OA 3 A 5 - I 5 OAIA 4 + 1200At) 
Throughout this subsection we take x _ v n'im 


* Due to Charlier and termed the “Differential Series” by the Scandinavian School, 
t 1936 formula corrected. 
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where the are the semi-invariants of the parent universe. For these expressions terms in 
are neglected. They were derived from the moments (from zero) of which were 
obtained by the method described in the 1936 paper. It will be noted that, to the approxi¬ 
mation used, the expressions involve only the first six semi-invariants of the parent universe. 
When the parent universe is normal all the A^ {i > 2 ) are zero. The magnitude of the numerical 
coefficients in the foregoing approximate expressions for the indicate that, when the 
universal values of the A^, particularly those of uneven order, are not very small, the frequency 
distribution of t may differ appreciably from the classical Gosset-Fisher (1908, 1926) dis¬ 
tribution. 

The formal Gram-Charlier expression for the frequency of t could, of course, be written 
down at once from (2*18). It is doubtful, however, if the Gaussian can be regarded as the 
most appropriate generating function for the frequency of t because, even when the parent 
universe is normal, the semi-invariants T^m higher even orders are large for moderate 
values of n. For example, 

- 6/(7^^5), = U0l(n^6) 

It is proposed to use (2*13) for finding the approximate frequency with 

m - T(l, n) = ! (l +~)“7 

the Gosset-Fisher frequency. Let 

/ \-l« 

It can easily be shown that the rth derivative (in t) of is 


(2-19) 

( 2 - 20 ) 




— 1 )! 






2.4 


(n — 1 )! (ti— !)■ 

r(r-l)(r-2)(j--3)(r-4)(r-5) W \ 

^3 t 


“Kw+ 2 r) 


( 2 - 21 ) 


with 


n, 


u— 1 
nTl' 


Tip = 


(tI“ 1)2 


(?i+l)(n-H3)’ 


Tlo = 


(?l~“ 1)2 


(?i + 1) (ti 4- 3) (ti 4 - 6) 


etc. 


Note that ( 2 * 21 ) assumes the Hermite form when ?i = oo. 

The theory will now be applied to particular examples using in all cases n = 10 . The 
universes will be assumed to belong to the Karl Pearson system, so that (M. G. Kendall, 
1941) the values of A 5 and Ag can be derived (given Ag and A 4 ) from the following equations: 


(l-t-49/)A3-h2g 

= 0 ; 

{l + 57j)A^+3^As + ^V 

= 0, 

(1 -f Qi^) Ag -f 4^A4 -|- 249 ^X 3 

= 0, ■ 

(1 -h 79?) Ag + SgAg -t- 109?(4A4 -h 3Ai) 

= 0. 


From the first two equations 

Tj == (2A4-3Ai)/(-10A4+12Al-12), 


which, suhstitiited in the first equation of ( 2 - 22 ), gives The values of ^ and substituted 
in the thirdandfourth equations, give Ag and Ag. From (2-18), the being the semi-invariants 

Bioraetrika 34 ^5 
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when the parent universe is normal (i.e. the values found when all the A’s are set equal to 
zero), L^~L[^J^n-^ + K^n-i, 

Xg —j&j —^5 (2'23) 

The J and K are the terms in the A in (2-18). To n~^ (i.e. ignoring «.-*) the frequency generated 
from T of (2-19) is as follows: 

fit) = + (.4 + ^i)+|f'(J4 + 4J,/3) + ^Vi| 

( D3 D® 

+ + + lOJg Jg “f 10 Jf/g) 

+ ^^iJA + 2Ji Jl) + J)®) + (Y (^2 + 2JiZ,)+g (X, + 4J,X, 

+ 4 j 3 j!f]L+^^l'^ + (^ 6 ^“ 15/2^4 

+ 60e7j^ J 3 + 15«/|J4 + 20e7JJ^)+jY“g^ (16t/g/fg+ 10t7| H-* 80t72t73 

7110 t 4 \ 

+ 80JiJ3J,+ 80J|Ji) + ^(3Ji/,+ 4/,Ji) + ^^ (2^24) 


To (2*24) agrees with the formula given by M. S. Bartlett (1935), in which, however, 
there is a small and obvious slip in a sign. The law of formation of the numerical coefficients 
of (2-24) is evident; for instance, the numerical coefficient of DU^Jl is 1/144'*=: 1/2! 31^2!. 

The integrals f ^and f (^ > 0) are found by reducing the exponent of JDbyunity, as follows: 
Jt J —00 

Ddt = - T, = f * D^^dt = D 2 m-i, = - f"* D^”^+^dt = 

J —00 J I J—flO J i J—OD 

(2-25) 

In normal theory the upper and lower 2^ % points of t are + 2-262 for w = 10 . Table 2 
shows the ‘true’ probabilities, i.e. the value of 


/• _ 9.Oft9 

[ f{t)dt (2-26) 

J -CO 

for parent universes specified by Ag, A 4 , using (2-24). 

There are two observations to he made on the results presented in this table. The first is 
that, despite the considerable number of terms (shown at (2*24)) included in the probability 
expansion, the values found in the successive terms cannot be regarded as satisfactorily 
convergent for so small a sample as 10 , and, of course, the convergence disimproves with 
increasingTaken all together, however, they seem consistent and significant. The second 
observation is that attention was confined to the negative ^taih of the distribution. It may 
be assumed that, in all cases, the distortion would be very considerably less marked if 
regard were had to the probability for | ^ | > 2*262. Actually for universe 3 the probability 
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is 0*066, not significantly different from tlie normal theory probability of 0*06, In justifica¬ 
tion of the attitude adopted above, the point might be put as follows: 

We decide to accept the hypothesis that the universal mean is zero provided that the value 
of t found from the particular sample satisfies where 

Prob {t < ^o) = Prob (t > t-^) = 0*025, 

The table is designed to show that if the parent universe is markedly asymmetrical the 
range ii) may differ appreciably from = 2 = 2*262. 

Table 2 . Probabilities of t less than 2*262/or sarnples of 10 for seven universes 


Universe 


A 4 —/?a*- 3 

Probability 

Hormal 

0 

0 

0-026 

2 

0 

1 

0-024 

3 

1/2 

0 

0-041 

4 

1/V2 

1/2 

0-047 

5 

1 

0 1 

0-072? 

0 ! 

1 

1 1 

0-086? 

7 

1/2 

1/2 

0-043 


As anticipated by earlier work (W. S. Gosset, 1908; R. C. Geary, 1936), the table shows 
that the distortion is slight for symmetrical universes; even when A 4 = 1 (and A 3 = 0 ) the 
probability (0-024) is practically identical with the normal value. There can be little doubt 
that the standard table probabilities can be seriously at variance with the true probabilities 
when the universes from which the samples are drawn are markedly as 3 nnmetrioal. 

(c) Difference of means 
R. A, Fisher's (1926) test of significance 

, {]c[-kl)^in'+n"-2) / n'n" 

{{n'-l)k'^ + (n"-l)kl}isj {n' + ny '' > 

for the difference of averages k'^ and kl in normal theory for random samples numbering n' 
and is, of course, a particular case of the analysis of variance considered in § (a) above. 
The second cumulants are and fcg- Ib is assumed that the unknown universal means and 
variances are equal. Suppose now that the random samples in reality have been derived 
from universes in which the means are equal but the other semi-invariants A^ and A'' are not 
necessarily zero for i > 2 , or even necessarily equal. Since the universal means are assumed 
equal, without loss of generality we may take A^ = A^ = 0 . This general mathematical model 
seems to be the correct one; we are not trying to determine the probability of the samples 
being derived from the same loniverse but rather if they could conceivably have been drawn 
from universes with the same arithmetic mean, however much they may differ otherwise. 
The correctness or otherwise of the concept may be considered in relation to, say, the 
problem of deciding from two random samples which of two types of fertilizer is to be pre¬ 
ferred from yield observations on a given crop on a given kind of land. Undoubtedly the 
prime problem will be that of ascertaining which is probably the better yielding (i.e. whether 
the arithmetic means are significantly different). Of considerably less importance is the 

15-2 
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question of which fertilizer is the more variable; of less importance still is the question of 
deciding; say, whether with approximately equal yields one universe is symmetrical and 
the other markedly asymmetrical. The point is that the question of the equality of universal 
means should be considered without assuming that the other semi-invariants in the universes 
from which the samples have been drawn are necessarily equal. This essentially is also the 
viewpoint in R. A. Fisher’s randomization method. 

Expanding the denominator of (2‘27) in terms of — and — and computing 
therefrom the first few terms of the first four moments of t, we find the following approxima¬ 
tions to the first four semi-invariants: 




2(n'A^+n"A") 


..r .a^.K\(,2n'A'^+n"A'A 

/ /a / 'if \ if 


+ 




K 


K 


n'^ n 


3(A^-A'') 
{n'A2+n 
..r - 6(n'A'2+w"A'^^)/A' , 

^ {n'A '2 + n"Al)^ \n''^n"' 

, 18(A^-A'^)^ (K A''\ _ 

(n'A '2 + n"A[}^ U' n") ^ 


K^-n"^)(A;A'’-AlA') 7(A^-A^)^ 

n'n"{n'A'2 + w"A'')2 4(7i'A' + n"Al)^ 


6 | — 

U'" 

n"^ 


(^3-A-) 


^"ADW W 

^^1 (w'A'-tri"A'^) 

f%2 \ IV / -2/ 


(2*28) 


with 



(A>'-1 + A'>"-1)1 

ll n'n," j 

' {n'+n''-2) 1 


Using formula (2- 24) to the term in with the Gosset-’JFishet function again as generating 
function, Table 3 shows rough approximations, for four examples, to the ^true’ probability 
of values of ^ ^ r, where r is the (negative) value for probability 0*025 from the normal table, 
and Ag = Ag = 1. When the two samples are drawn from different universes the distortion 
can accordingly be considerable. The third example suggests that if the universes are the 
same the distortion is small, a result to be anticipated from the fact (apparent from (2*28)) 
that, to the approximation used, the first two semi-invariants are equal to their normal 
theory values; this theory confirms the experimental results of E. S. Pearson & N. K. Adyan- 
thaya (1929). 


Table 3 


Example 

n' 

n" 

A,' 

AS 

Ai 

AI 

Probability 

1 

12 

4 

1 

~1 

1 

-1 

0*045 

2 

18 

6 

1 


1 

-1 

0*041 

3 

7 

4 

1/V2 

1/V2 

1/2 

1/2 

0*027 

4 

10 

6 

1 

1 

1 0 

1 

0 

0*036 
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It should be remarked that the probabilities in Table 3 (as well as in Table 2) are merely 
rough approximations ^the samples used are far too small for the results to have any preten¬ 
sion to accuracy. The object has been merely to show that the actual probability could be 
considerably &t variance with that shown in the standard table, for small samples. ■ 


3. Smi'I'ICIENT CONDITIONS FOE. APEEOAOH TO NORMADITY OF a{c) WITH INCEBASING n 

The remainder of the paper deals with the field of symmetrical tests of normality, homo¬ 
geneous of degree zero, represented by (3-1). It is essential to establish the conditions of 
approach to normality of the frequency distribution of a(c) as the sample number increases. 


where x = £xifn and c is non-negative. It will be shown in succession that, subject to stated 
conditions, with increasing n, 

(i) the frequency distribution of 


®i(^) ” .jj, I 




Uc 




(3-2) 


tends towards normality, and 

(ii) the frequency distribution of a^{c) tends towards that of a(c) and hence towards 
normality. 

It is assumed, without loss of generality, that the universal mean of the universe from 
which the sample of n is drawn is zero. Denote the Z;th absolute moment from zero by 
k not being necessarily an integer. Given a positive quantity e arbitrarily small, aj(e) can be 
found so that 


Prob 




<(i) 


(/^|2cl /^|c|) | ^ ^ 

n / ' 


(3*3) 


Prob 


!v 


< 0 ) 



> 1 —e, 


(3-4) 


provided, of course, that exist. As n increases w may be envisaged as approaching 

the normal probability point appropriate to the probability e, since, in the conditions stated, 
S I |“/n and UxHn are normally distributed in the limit. For samples which satisfy the 
inequality in the brackets { } at (3'4) and if n is so large that 

the denominator of (3-2) can be expanded to three terms (including the remainder) by 
Taylor’s theorem, so that a^(c) may be written 

•l(«) =■ £!/.)x) , (3-6) 

with 2/i == |‘’“-“lcl)/-“|o|3 

i 0 ^-J(c+4) 


( 0 < 6 »< 1 ). 
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With probability exceeding (1 — e) it is evident, from (3-4), that X is maximized by 

\ /KaV n] 

It will suffice, for the present purpose, to infer that 

lXi<x, 

where /c is a constant independent of n. We have now 


Set 




1 Cy«|c+2| /c ,\’*) 

»Wfol Wi \2 J }’ 

(3-6) 

and 


(3-7) 

with 


(3-8) 


For samples which satisfy the inequalities in { } at (3*3) and (3*4) and hence with a pro¬ 
bability exceeding (1 — 2e), we have 

+ (3-9) 

where*^ is independent of n. Or, briefly, 

Probj|Mj<J^j>L-2e, {3-10) 

SO that u tends in probability towards zero with Ijn. Now (3*7) may be written in the form 
u = y' — F, where Y' and Y are the respective terms on the left side. If A be any number 
and F the total probability function, a well-known lemma (Frechet, 1937, p. 164) shows that 

1 TAA)-Fr(A) 1 < [Fy{A + j^-Fy -|j)) + (3-11) 

using (3-10). Hence the frequency distribution of 

y, _1 l y^o>i{c) 

<^1 /«lc| 

tends towards that of Y = — E (3‘13) 

at every continuity point of the latter frequency, as n tends towards infinity. But Y, from 
(3-13), is the simple average of n random measures, and its frequency must tend towards 
normality provided that its standard deviation exists; from (3-6) it is evident that cr is 
finite provided that /4n,|, where h is the greater of 2c and 4, is finite. Here and in the remainder 
of this section it will be useful to remember that if /iu,| exists so does for 0^k'<:k, 


( 3 - 12 ) 
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To prove that the frequency distribution of a(c) tends toTvards that of ^^(c) and hence 
towards normality with increasing n it will be shown that Z j a:^; ~ S [“/»tends in probability 
towards Z j Xi Two cases will be considered sepaxately': (1) c?= 1, (2) 1 >c>0. 


Case (1), c^l 

For values of for which | a;^ ] > | »[, 

\Xi-x\<^-\xi\<^ = ±cx\xi-e^<^-'^ (O<0<1) 

and when |a:^-1 <1^1- I®-1 1“ 1 g(2«+1) |»|=. 


Hence 






< X 


IB " , 

(~ s 

\n i«i' 




)' 


(3-14) 


B and C being independent of the x^ and n but depending on c. With e arbitrarily siu n ll 
o> can be found so that 


Prob{la;l<6; /——e, 
n\ 


ProbI 




<0) 


y ^l2o-ai 


> 1 — 


(3-15) 


Hence, from (3-14) and (3-15), ifand /t| 2 (,_ 2 | exist, 


Prob 


^S\xi-x\'^-~Z\xi\<= 


<B 


^jn 


>l — 2e 


for n sufficiently large the constant B' depending on c but not on n. Hence for c^l, 
E\xi — x\^jn tends in probabihty towards E\x^\^jn. Incidentally, this proves that 
[S(x^ — xYjnY^ tends in probability towards the latter two expressions representing 

respectively the denominators of a(c) and ai(c). 

Case {2), l>c>0 

Let X satisfy a probabilistic inequality identical in form with the first equation of (3*15) 
and let y be any positive quantity, fixed once for all. Let n (presently to be defined further) 
be so large that 


y><x) 


Then 


1 ^ 

- 2 
^ i-l 


®)=^( S' + 2" 


(3-16) 


When \xi\'^ 2'), 


so that 


Prob 


(n 


1 1 ^ = -^cx\x^ — dx 1 *^“^ (0 < 0 < 1 ), 


x^ — x\ 


I <C0} 


(’'-“VC* 


>1 — 6 . 


(3-17) 


When \ x^\<y (i.e. in S"), given t] arbitrarily small and positive, % can be found so that 


when 


\Xi-x\<=~\x.i 

I x\ <0) 


<'7. 


(3-18) 


IH 

V n ’ 


since I a: 1“ (c> 0) is uniformly continuous in Z". We then have 

Prob{l j a:^—« |‘=-I a;^ 1®J <^}> 1 —e. 


(3-19) 
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Combining (S-n) and (3'19), it may be inferred that 


Prob 


n' n ' ' 


<co)^y~0}J^'^ J^ + 7]^>l-2e, ( 3 ' 


20 ) 


the first term of the upper limit in { } tending to zero as n tends towards infinity, and e and 7 } 
being arbitrarily small 

We have accordingly shown that the numerator and denominator of a{c) tends in pro¬ 
bability towards those of Hence a(o) tends in probability towards ^^(c). Hence, using 
the lemma cited at (3*11), the total frequency of a{c) tends towards that of aj^(c) which tends 
towards normality as n tends towards infinity. ]?inally; 

If c^Q ihe frequency distribution of a{c), given by (3*1), tends towards normality as n tends 
toxvards infinity provided that'where k is the greater of 2c and 4, is finite. 

It seems likely that an analogous theorem can be proved for 0 > c> — ^; we shall not, how¬ 
ever, be concerned in this communication with negative values of c. 


4 . Momekts OB’ a(c) bob normae samples 

While it will be shown in later sections that, with indefinitely large samples, and are 
the most efficient tests of asymmetry and kurtosis, respectively, it by no means follows that 
other tests are inefficient or that they may not be useful supplements in cases in which the 
prime tests are indecisive as to the probable non-normality of a given sample. It is accord¬ 
ingly proposed to give here close approximations to the first four moments (from the origin) 
of a{c) (given by (3*1)) for normal random samples of n. 

For normal samples (R. A. Fisher, 1929; R. C. Geary, 1933) 

n ft '1^ / /I Ucfc 

Mi{a{c)} = E[a{c)f=E\^^^^\x,-xY^ j e\^^^{x,-xY^ ' ( 4 . 1 ) 

The exact value of the denominator is, of course, known, for 



since, as usual, (ti — 1) 5 ® — It will be useful to expand loge with fc' = ck using 

Stirling’s formula in (4*2): 

\og,Es^' = + 

4(n-l) 12(11-1)2 '^24(»i-l)2 . 120(n-l)* 

2)2 (^'2- 2¥ - 2) ¥{¥ -l)(¥-2) (3¥^-12¥^ + 24¥ +16) 
60(51-1)6 252(51-1)6 

¥^¥ - 2)2 (3ib'6 - 12¥^ - 4¥^ + 32¥ + 32) 

336(51’ 

•which checks for = 1 to (ii- 1)-’ with Geary (1935, p. 354). Take 


(4.3) 

(4-4) 


with 


Zi = x^-x. 
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The moments o£ u(c) will be found exactly as in the case of c = 1 (Geary, 1936) from the 
single or joint normal frequency distributions of {z^, ). We find 

1 lin —/c —1\ 

For the third moment we write 

M'Mo)) = ^ 1 1^+—I1"“ I ^2 Is, 1 


(4-5) 


= Ai+A<^+As, 

denoting the three terms on the right by ilj, ^ 2 , ^3 respectively. Then 




c ^“4(4-h3G)^ 


■ 


3.2*‘=/2c-1\ ./c-1 


77 


'{ 


l(?^ - {n ~ 1)“^®? 

2c "H1) (c 
4!(w-l)^ 


11 I (2c+1) (c-f ^ (2c+ 3) (2c +1) ( 0 + 3 ) (c+1) . 




2c\l 


(n ~ (n — (?i 1) n~^ 




c-l) 

2 


^■^2(w-2)2 


(c + l)s (c + l)Mc + 3)(7c+9) (c+3)=‘(c+l)3 
(?i-2)® 8(?i-2)* 2(ri.-2)® 

(c + 3)^(c+ 1 )2 (61cH310c+265) 


240(?i-2)6 

Similarly, for the fourth moment, 




Km - -EW'))* = I *. I*"+ EI *. |» II- 


+ 


3n(n-l) 


71* 




67i(«- 1) (?i —2) 




+ 


n{n — l){n—2) (w-3) 


^ 1-^11“ ha 1“ hsl '\»i 


with 


= 0 i + 0^-\-02 + G^-\^0^ 
22c 




(¥)i. 


22(c+l) 

Cj = --(ft - 2)^(*=+« (w -1 )-2“ W-* 




,, , (3c + l)(c + l) , (3c + 3)(3c+ l)(c+3)(c4-l) 
2 !(m- 1)2 4!(w-l)^ 


+ .. 
(4-6) 

^al^hal” 

(4-7) 


(4.8) 


+ ... , 
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3.22« ; , (2c + l)® {2 c + 3)2(2c + 1)2 | 

Oz =—(n-2)K*o+«(n-jlj {l+i!(^ZT?+ J!(n-1)^ +‘-t 


3.22C+1 
/r* 


(?i-(»-2)-«*«+« {n -! (^)]‘ 

I, (c+l)(6c+3) (c + l)®(2c+l) (c + 1 )(57c^ + 227 c^ + 255c+81 ) 
2(«-2)2 (w-2)» 24(71-*-2)4 


(2c + l)(c-i-l)Mc + 3)(5c + 9) 


6(n-2)'^ 


+ ... 


2^ 

^5 ~ ^ ~ 4)1!4“+®> (m — 3)“®“"^ [n - 


■ 2 ) ! 


+ 


3(c-1-1)2 4(c+l)a (c-hl)2(7c2-l-21c--f 16) 4(c-h 3) (c-F-1)3 (2c +3) 
(71-3)2 (71-3)3 (71-3)4 (71-3)« 


-H 


(c -4 3) (c -h 1)2 (122c3 -f 671c2 + 1070c -)- 526) 
16(71-3)3 


Formulae (4*5), (4-6), (4*7) and (4-8) were checked from the corresponding formulae for 
c = 1 given in the author^s 1936 paper. 

From the following section it will be apparent that for indefinitely large samples the most 
sensitive test of kurtosis of the field a{o) is found for o = 4. At the same time it is shown that 
there is really not much difference in efficiency for values of c in the range 5 > c> 2; moreover, 
the results in § 6 (in which the efficiency of the tests for c = 4 and c := 1 are compared from 
the power function viewpoint) suggest that, for samples of moderate size, the superiority, 
if any at all, of a test using a(4) = over other tests in the series may be even less marked, 
.The disadvantage of a(4) is that its frequency is not known for samples of aU sizes; and if we 
could estimate, with any degree of confidence, the probability points of a{c) for any value or 
values of c> 2 fox medium-size samples we might, for practical purposes, dispense with u(4) 
altogether, since, while we now know one way of solving the problem of determining the 
exact, or almost exact, frequency distribution of a(4), it must be admitted that the method 
is extremely tedious. (From the theoretical point of view, however, the a(4) problem must 
be solved since it remains a challenge to the mathematical skill of statisticians!) It will 
accordingly be of interest to study the order of magnitude of the semi-invariants of a{c) 
for c near 2. 

Consider the ease, for example, of c = 2-4, not by any means, it is important to 
observe, the lowest value which would be used for tabulating. In Table 4 the first three 
moments are given for n — 25. The L^s represent, of course, the semi-invariants. The values 
of the functions for ^^^(c) (given by (3*2)) for n = 24 (i.e. the appropriate number of degrees 
of freedom for comparison with a(c)) are also given. These show that the moments of a^(c) 
are very close to those of u(c), which suggests that, when n is not less than, say, 20, the values 
of JSj, and corresponding functions of higher orders, if required, for ^^(c) could be used 
for the determination of the probability points of a(c). This is important from the com¬ 
putational point of view because the algebraic expressions for the normal moments of 
are exceedingly simple whereas it must be conceded that (4*8) offers a grim prospect for the 
computer; furthermore, the principal .term is rather slowly convergent unless n > 60 or so, 
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■whereas exact values for all values of n can readily be found for the moments of ai(c) for normal 
samples. 


Table 4. Normal moments, etc., ofa{c) and a^{c)for c = 2-4 



a(2-4) 

«i(2-4) 

n 

25 

24 

1-1 

II 

M66262 

1-1662624891 

Mi 

1-362004 

1-362091186 

Mi 

1-692841 

1-693161616 


0-001860 

0-001946318 


0-000063 

0-000069683 

^B,=LJLl 

0-80 

0-8104 


As with (4-1) for a(c), the moments (from the origin) of any order of ai(c) is the quotient 
of the moments of the same order for numerator and denominator, assuming that the 
universal mean is zero and the variance unity. Since the different members of the sample 
are independent—the difficulty with a(c) is that the {x^-x) are not independent—for the 
moments of the numerator of (3-2) we require only 


E\x\^' = 

and for the denominator 


V(27r) 




— 2\ ^ I(71-2' 




(4-9) 

(4*10) 


The case of c == 4 is particularly simple. The first four semi-invariants are as follows: 


Ly 




3ft 

(ft-|-2)’ 


r = M = 24ft^ (ft-1) 

^ 2 (« +2)2 (ft+ 4) (ft -I- 6)’ 

r - M 1728(ft-l)(ft-2)ft2 

® ® (ft-h 2)2 (ft-1-4) (ft-f 6) (ft-I-8) (ft-f 10)’ 

10,368ft^(ft -1) (30ft* -1- 168ft2 - 608fta - 2672ft -f 3712) 
“ (ft -f 2)* (ft -1- 4)2 (ft -h 6)2 (ft -1- 8) (ft -1-10) (ft -I- 12) (ft +14) 


(4-11) 


Moments, etc., for ai{c) for normal samples of 24 and 50 are contrasted for c = 2-4 and 
c = 4 in Table 6. The contrast between the values of and (JEfj—3) respectively for 
ai(2-4) and ai(4) is striking in the extreme. Even for ft = 24 [ai(2'4)] and [aj^(2'4)] 

are approaching the values at which a Gram-Charlier approximation to the frequency 
distribution may be reasonably convergent. Furthermore, the deoUne in the values of the 
£’s from ft = 24 to ft = -50 is marked for ai(2‘4), while the decline in the JS[ai(4)] is very slow. 

It is accordingly suggested that a table of probability points (perhaps 0-001, 0-01, 0-026, 
0-05 and 0-10) of a{c), for c equal to, say, 2-2, be prepared for ft ^ 26 on the assumption that 
Gram-CharHer Qipplies throughout. For this purpose the values of the mean and variances 
for ft at intervals of, say, 10 should be computed from formulae G‘6) and (4-6); the B^ and 
('.Bj — 3) should, however, be computed as for ai(c). For lower sample sizes it might be well 
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to use terms to order rr^ which would render necessary the use of the fifth and sixth semi¬ 
invariants oia^{c), The formulae given by E, A. Cornish & R, A. Eisher (1937) (assuming 
Gram-Chariier) could be used to find the probability points. On account of the minuteness 
of the variance for c near 2 it will be necessary to work to many places of decimals—at 
least 10. As stated at the outset, the test of kurtosis a(2-2) will be only slightly less efficient 
than a(4) and it may be slightly more efficient than a{l), the probability points of which are 
known approximately for samples of all sizes. In any case the a(2*2) table would be a useful 
adjunct to that of a(l). 


Table 6, Normal moments, etc., of a^(c) for c = 2-4 and c = 4 



n:= 24 

11 

Ot 

o 

0=2*4 

0 = 4 

c=2*4 

0 = 4 


1-1602624891 

2*769231 

1*1721603127 

2-884616 


0*001946318 

0-569932 

0-001058462 

0-369660 


0-000069583 

0-762488 

0*000022261 

0*343337 


0*000004921 

1-956999 

0-000000919 

0*711375 


0*8104 

1*7960 ! 

0-6462 

1-6926 


1*30 

6*24 I 

0-82 

6*60 


In an earlier paper (1936) the writer suggested that the correlation between and a(l) 
for normal samples gave some indication of the relative efficiency of these two testa of 
normality. In this order of ideas it seems desirable to compute the approximate value of 
the correlation coefficient between a{c) and a[c^)j where c and c' are any two positive con¬ 
stants. In the first instance the universe from which the sample of n was drawn was not 
necessarily normal. Since in the present appHoation we will be concerned only with large 
samples we assume the universal mean known (and accordingly it may be taken as zero, 
i.e. Ai 5= 0), so that, instead of a{c) we use, in reality, a^(c) given by (3*2), In the remainder 
of this section we write a for ai(c) and a' for ai(c'): 


Set 


Then 


^ _C(C + l){C + 2)...{C + k-l) 

^ ---n-. 


k\ 


aa 

aoc 




-Kc+c') 


(4-12) 
(4-13) 


(4-14) 


(4-15) 
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Th© value of (kx jcccc was found approximately (i.e. to terms in 7 h~~^) by formally 
expanding the last factor in (4*15), multiplying by the first two factors, and setting down 
the mean value term by term, so that 

[l+-^C2,7iEz^ — ^C^nEz^ 

y fo Ti 

+ ^(^4(w-Ba* + Ez^Ez^) 

+ -6C'e 90-g-+1 - -2 {nEyz + nEy'z )+^ {nEyz^+nEy'z^) 

Q _ 

-^ {n[Eyz^ + Ey'z^) -{-Znn-l Ez^Eijz + Ey'z)] 

7h 

Q _ _ 

+^l4:nn-i Ez^Eyz + Ey'z) + Qnn-\ Ez\Eyz^ + Ey'z^)] 

30(75 71.?^ — Iw —2 ft 

— --- Eh\Eyz + i?2/ a) j + nEyy' - ^ nEyy'z 

Q _ _ 

+ ^ [nEyy'z^ + 2 n n -1 Eyz Ey'z + nn-1 Eyy' Ez^] 

Q _ _ 

— l[nn-l Eyy' JSz®+3«w -1 {Eyz^Ey'z + Ey'z^Eyz + Eyy'zEz^)^ 

7h 

+ ^ Eyy'Eh^ + 1 2n ^ EyzEy’zEz'^'Ji. (4-16) 

The E'& in (4*16) are readily calculable from (4*14), e.g. 

Eyy' = ^ViVi = -®(| l^“/^ici) (| h “'/^ici)//^lc|/^lc'l ” (/^jc+cV/^[cl/^lc'|)“”^* 

It has been verified that when c is substituted for o' in (4*13) the formula agrees with that 
for the second moment of a^ic) given in § 6 . 

The coefficient of correlation is, of course, 

with = 

Formulae for the first and second moments, to the approximation required, for the com- 
putation of (4*17) are given in § 6 . 

As an application, the following are the values of the variances and the covariance for the 
test of normality a(l) and ( 62 )) i*©* iti which c and c' have respectively the values 1 and 4, 
and where the universe belongs to the Pearson system with Ag = 1 , Ag === 0 and A^ = 


71 

0*491 




71^ 




4-87 281*5 

71^ 71^ 


rr 


JI 4 ^ 0*09313705 0*262961 0*196477 

'Kr n 

Jif,,,. _ 4-4286 92-26 
//2 ~ 

Jhf,v 


831-2 


(4-18) 


/^\o\hc'\ 


n 
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From (4-17) and (4-18), 100)-0*826 and Rc^,{n=co) = -0*764. It is of great 

interest to find that, though the universe is markedly non-normal the correlation for in¬ 
definitely large samples is practically identical with the normal theory value of -0*767 
(Geary, 1936), another indication, no doubt, that normal theory inferences can usually be 
applied with confidence when the parent universe is not markedly unsymmetrioal. 

When samples are indefinitely large we find, from (4*16) and (4*17), 

D ^/<lc+c'l ” + c'/«lo'|/“lc+2|) + (CC'/i 4 — C — 2. C' — 2) y\c\y\c'\ 

mx) ' ' 


where, of course, the values to be taken here for and found by substituting 

respectively c' for c and c for c' in the numerator. When, in addition, the parent universe is 
normal, we find 



which reduces to ~ l/^{12(7r - 3)} for c = 1 , c' 4, as it should (Geary, 1935). The following 
section will accord (i.e. a(4)) a decided primacy amongst tests of normality when the 
samples are indefinitely large. It may, therefore, be of interest to give the values of the 
correlation coefficients (for indefinitely large normal samples) between and a{c) for 
selected values of c (Table 6 ). The table suggests, in the high coefficients of correlation, 
except for c very near 0 or 3, that all the a{c) should be reliable tests of kurtosis, with no great 
difference between thbir efficiencies. The efficiency of any two tests would he identical, in 
the conditions stated, if the coefficient of correlation between them was ± 1 because then, 
of course, they would be functionally, and not stochastically, related. 


Table 6 . Gorrelation between 62 a{c) for indefinitely large normal samples 


Value of c 

Value of H;* 

Value of c 

Value of 

0 

0 

3 

0*980 

1 


4 

1 

2 

0 

5 

0-983 

2»2 

0-887 

6 

0-939 

2*6 1 

0*962 

00 

0 


5, The most epeioient tests fob in-defu^itely laboe samples 

In this section we consider the efficiency of tests of kurtosis and asymmetry from the view¬ 
point of indefinitely large samples. 

By definition a test will be regarded as valid, in relation to a field of continuous alternative 
universes including the normal, if its value for infinite samples drawn at random from the 
normal universe is different from its value for infinite samples from other universes of the 
field. As the sample number increases the test will become increasingly discriminatory of 
the normal as distinct from other universes of the field. This increased sensitivity might be 
given mathematical expression in some such terms as the following: given a probability a 
(say O’Ol), the normal universe Wq of the field and any other distribution Wi of the field. 
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a number can b. found so that for » > the meiui ualue of the test taction for sampfes 

ef»iKmr.rnn he at or beyond theoprobabiHtypointof the test function for samples of» 

from Wf ,: the smaller the more sensitive the test. 

We consider, then, the inimito field of alternatlTe teste of Imrtosis represented by (Sd) 

when 0 ^sumes all posjtere values, and the infinite field of alternative universes repreinted 
by the Gram-Charher frequency ^ 


a \ dxj 


The univers,al variance is assumed to be unity, without loss of generality. The normal 
universe is a member of the field: it is found when aU the A, (i > 2) are zero. We assume that 
the conditions of §2 are satisfied so that for indefinitely large samples the frequency dis¬ 
tribution of u(c) for all parent universes is normal. Obviously the efficiency of any particular 
test (i.e. a(c) for a particular value of c) in regard to the normal and a particular non-normal 
alternative (i.e. a Gram-Charlier frequency with particular values of the A.) will be adjudged 
by considering the ratio of ^ 

(i) the difference between the universal mean values of a{c) for the normal and the 
particular non-normal parent universes; to 

(ii) the standard deviation of ^(c) for indefinitely large normal samples. 

The most efficient test will be a<o) for c a theoretically ascertainable function of the given 

which makes the ratio a maximum. 

For indefinitely large samples the mean value 0 of a{o) when the parent universe is given 


Obviously 
Also, when 1, 


V(27r)J. 
^ 00 
J — 00 


dx I X I®exp Is 


dx\x\<^ 


\ dx] 




e-i®’ = 0. 


/*“ / dV”^ /c —1\ 

] (-2-)!2«“'^”<=(c^2)(c-4)...(c-2m-f2), (6-3) 

a result readily inferable from the obvious fact that the left side vanishes for c = 0,2, 

2m— 2. Accordingly 

The normal value is given by the first term. 

From (4-3), (4-6) and (4-6) it is evident that the value of the standard deviation, for larger 
normal samples (retaining only n~^) is 

21'= //2c—1\, , /c-l,\^c2 + 2\l 

!)—)■ (fi.6) 

The principal term in the deviation (where is the normal value), from (6-4), is 

i(c-l)!2l' A4 c(c-2) 

■ 24 • 


( 6 - 6 ) 
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To a constant factor, the ratio Sjcr is given by the first discriminant 

-i 

c^ + 2\ 


p(c) = c(c-2) 


(^)» c> + 2 


It will now be shown that 


^p(g) 

do 


I (ti)' 


= 0 for c = 4. 


The discriminant may be written in the form 

p(c) = c{c-2)^ 

X = 


where 


and 




p[c) c 0-2 2 


Qfl/ -4clQg 

I i 


/2“4 

c®+2\-i 

1 X 

“ 2 j ’ 

k 

cos" 

1 

edd, 

'■li. 

XJo\ , 


n) 


\ lo 


f ITT 

From(6'9) \ ci!(9 log" 0 log cos 6>. 

From a fairly well-known property 

riff 

Jq = d6 log cos 6^ — hr log 2. 
Jo 


(5-7) 


( 6 - 8 ) 

(5-9) 

{5-10) 

(5-11) 

( 6 - 12 ) 


In (5T0) we shall be concerned only with even positive integer vahies of c. We have at once 


^ r -.1 T T T — 

2' a-J’ ''8~32’ ■^®~ 256' 


(5-13) 


From. (5-11) Xc= f dd coa®° 0 log cos 0 = (* ci!(sin 0) cos®“"^ 0 log cos 0, 

Jo Jo 

which, by partial integration, 

/* liT /_—1 fj qin ■ 

= d 0 sin 0 ( 2 C’-l Bin 0 cos^“*'^ 0 log cos 0 H- 5 -) 

Jo \ 0030 / 

= (2c — 1) (J 2 C -2 “ ^ 2 c) "h - 40-2 

Hence = (2c — 1) *4c "h Jac-s* 

From (5-12), (5-13) and (5*14), 

= -•|7rlog2, Jg := (-.607rlog2 + 377r)/384, " 

J 2 -: (-~27rlog2-h7r)/8, Jg (-8407rlog2 + 5337r)/6144. . 
^ 4 = (-l27rlog2 + 77r)/64, 


(6*14) 


(5*15) 


Noting that I^c ~ substituting in the right side of ( 5 * 10 ) the values of I and / given 

by (5*13) and (6*16), we findp'(4) = 0 . Table 7 gives the values of the discriminant for certain 
values of c. 

The discriminant accordingly assumes a maximum value for c = 4 , a result so remarkable 
that one might be inclined to suspect that it is a consequence of the form which was assumed 
for the alternative to the normal curve, a form which, in placing such emphasis on A 4 , 
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high-lights, so to speak, 62 (” -^4 + 3 when A 2 = 1 for indefimtely large samples) as a test of 
normality. [From the algebraic point of view this is anything hut obvious: the property 
emerges from quite a complicated piece of algebra. It may also be emphasized that the field 
of alternatives (5'1) is not arbitrary; it is a general form of frequency distribution when all 
the are finite. Admittedly the discriminant takes account only of the term in A 4 in the 
expansion; but this is certainly the most significant term for a wide class of frequency dis¬ 
tributions, namely, those of homogeneous symmetrical functions of samples of n as n tends 
towards infinity under very general conditions for the parent universe, provided that the 
resulting frequency distribution can be assumed to have its third moment zero; for then the 
only term in in the frequency distribution of the function will be the term in A^. The 
significance of the property demonstrated must not be overstressed since it is subject to 
many qualifications, but it gives strong grounds for holding that, for very large samples, 
62 is the most efficient test of normality of tests of type a(c) in relation to a very extended 
class of alternative universes. At the same time Table 7 shows that there can be little 
difierence in efficiency in the field a{c) for c ranging from close to 2 to about 5. There is but 
little doubt, on this showing, that is more sensitive than a(l), a conclusion suggested on 
the basis of certain experimental results by E. 8 . Pearson (1935) and examined from the 
viewpoint of power function theory in § 6 . 


Table 7 


0 <c<2 

Discriminant 

p(c) 

2 <c<co 

Discriminant 

P(C) 

-f-O 

-2-334 

2 + 0 

4-460 

0‘1 

-2*641 

2-1 

4-608 

0*2 

-2*726 

2-5 

4-666 

0-5 

-3-188 

3-0 

4-801 

0-7 , 

-3*441 

3-9 

4*898 

1‘0 ' 

-3*758 

4*0 

4*900 

M 

-3-861 

4*1 

4*898 

1*5 

-4-166 

5*0 

4*818 

1*9 

-4-406 

6*0 

4*602 

2-0 

-4-460 

7*0 

4*288 



8*0 

3*906 


Adverting to ( 6 ' 4 ) in conjunction with (5*6), it might he asked if, on the analogy of the 
maximal property just demonstrated for the first discriminant, the function 


/? 2 (c) = c(c-2)(c-~-4)| 

has a turning point at c = 6 . The answer is in the negative. The value of P 2 ( 6 )/p 2 ( 6 ) is, in fact,. 
15/34. At the same time there must be a zero of P 2 (^) very near c = 6 since 
P5^(5*9) = 8'79, P2(6)-9*20, ^ 2 ( 6 *!) - 8 * 66 . 

Analogous to the field on tests of kurtosis represented by (3*1) we may consider as a field 
of tests of asymmetry: 

g{c) = + (6-16) 

Biometrika 34 
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•where S' extends to the observations Xi less than the mean x and S" to the rest of the sample. 
For c == 3 the test is, of course, fb-^. For normal samples 

E{g{c)f = + (5-17) 


the denominator of whioh is identical with the denominator of (4-1). Knowing the joint 
distribution (for normal samples) of {x^ -x), (x^—x), ... (Geary, 1936), there is no theoretical 
difficulty in finding the mean values of the terms of the numerator for positive integer values 
of k. Here we shall be concerned only with the first and second moments, i.e. those for (6-17) 
for fc = 1 and fc = 2 . We require the normal distribution ofzi = x^ — x and the j oint distribu¬ 
tion of and = X. These are 



] 

2(71-l)j 


dzit 


2(n-2) 


{n-2)i 


dz\dz2 — /(^i, ^a) dz^dz^. 


(6-18) 


Clearly the odd normal moments of g{c) are zero. Then 


E 


1 


n 


S'\x,-x\<=+^S"{x,-xr\ 


n{n^ 1) 


71^ 






where z^) is the mean value of the two-dimensional terms* We then have 
Fi(Zi, Za) == f (“ %)“dzj (~ 2:2)° <iz2f(z2, z^) - f dz^{ - Zi)® f dz2^2f{zi> Zg) 

J —00 J —00 J—OD Jo 

/*00 I'D f*oo ^00 

~ dzjZi dz2(-Z2)‘=/(zi,Za)-h dziz? dz^zi/lzi, Z 2 ) 

Jo J-OO Jo JO 


n o 

0 


z5z|dZidZ2{/(-Zi, -Z2 )-/(-Zi,22)-/(Zi, -Z2)+/(Zl.Za)} 


2 “+®/ 71 V‘(7l — 

2tt \n — 2 


(«-2)»+i/c \2f (c + 2)2 (c-h2)2(c-f4)2 1 

(w-l)«+ 2 \ 2 'j \ ■*'3!(n-l)2'^ 5!(»-l)4 '^•• 7 ’ 

/272.-1\« 1 

•I « 


^^ao = !£r 1-^277-1 
1 2 


Also 


El-E{x,-x)^ 




(6.19) 


(5-20) 

(5-21) 

(5-22) 


We now have all the expressions required for the variance of normal g{c). We require, for 
what follows, only the term in tv-^ which is 



Consider now a field of alternative universes represented by 


1 

V(27r) 



g-ia;2. 


(5-23) 


(5-24) 


the ‘first approximation to the law of error’ (for universal variance unity), obviously the 
most appropriate asymmetrical field, for different values of the parameter A 3 , and con- 
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taining as a member of the field the normal distribution found for A 3 = 0 . Por indefinitely 
large samples from (6'24) the mean value of si(c) is 


2 A 

‘^=6V(^)Jo 


e-i®® = - - 


'>^‘•¥74- 


From (5-23) and (6-26) 


5 

cr 




7-(c), 


(5-25) 

(5-26) 


the skew discriminant r(c) being given by 
Log-differentiating, 




/ 2^+1 T \-i 


T'(C) _ 

1 


1 

/ 2 / 2 C +2 

4c+24+i\ 

t(c) ■■ 

c -1 

2 ' 

l2c + l 

1 4i 

2^1 ) 


^2c+2 




4+2 log21 po+i 4+2 


4i (2c+1)2 •4+1 2c+1/ \2c4* 1 4+1 


(5*28) 


Setting c = 3 and using (5-13) and (5-15), we find that r'(3) = 0. Values of t(c) for four 

values of c are as follows: , . , , 

c t(c) c r(c) 


2 2-370 4 2-389 

3 2-460 5 2-236 


Accordingly, for indefinitely large samples the test of asymmetry g{c) is most efficient for 
c = 3, when the test becomes the familiar The margin in favour of this value of c, as 
compared with others -in the range 2 <c < 5, is, however, quite small. 


6. Tests of kurtosis from the power funotioh viewpoint 

It may be useful to open this section with an interpretation of the results of the previous 
section from the point of view of the type of error theory of J, Neyman & E. S. Pearson 
(1933, 1936). For this we consider two universes of the field, the normal and any non- 
normal universe and two tests of kurtosis a(4) == 63 and a(Ci) for a particular value of 
c. Suppose that samples are sufficiently large that a(c), for samples from all universes of the 
field, may be regarded as normally distributed. 

Given a probability a, a sample number % can be found so that the mean value of a{c^ 
from Wx lies exactly at, say, the upper a probability point of the distribution of a{c^ from W^, 
Then from the results established in the preceding section the value of a(4) for the same sample 
of n from could lie beyond the a probability point of a(4) for normal samples of n. 
Suppose that the rule adopted was to regard as non-normal all samples for which a(c) 
lies beyond the normal a probability point, and suppose that a very large number N of 
samples were drawn, JVj, from universes not significantly different from normal (defining 
‘insignificance’ in some manner) and Mt &om non-normal universes, so that N = Nq+Ni, 
where Nq and are not necessarily known in advance. Then using a{ci) the number of 
erroneous allocations will be approximately + whereas using a{4) the number will 
be aVQ + d-—^)iV5L (|•>i>> 0 ), showing a definite advantage in favour of a(4). The same 
conclusion emerges whatever value of c 4 = 4 or whatever non-normal universe be taken 
for comparison. 

The type of error approach reveals the theoretical weakness of using the method of § 5 
for the assessment of relative efficiency of tests of normality; namely that the proportion of 
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errors of judgment, even using a(4), remains large, due fundamentally to concentrating on a 
single value (the mean) as typical or representative of samples from the non-normal universe; 
it is also a disadvantage that the sample number is necessarily a function of the particular 
value Cl of c. The method has further disadvantages of which the principal are perhaps (i) a 
somewhat restricted field of alternative universes; (ii) the assumption that the samples were 
indefinitely large, essential to justify the normality of a(c) for samples from any member of 
the universe field. 

The Neyman-Pearson power function approach which will now be considered cannot be 
regarded as entirely free from these objections in its application to the material so far 
available from this research. It enables us,.at any rate, to contemplate samples which, if not 
small, are within the range of experimental practicability. 

The problem of the relative efficiency of the different members of a field of tests of kurtosis 
a{c) will now be considered in its power function aspects.- For the present purpose the power 
may be defined as follows: 

Given a probability a (say 0*01), a sample number n, a particular value of c and a 
non-normal parent universe the power, in relation to these data, represents the frequency 
of u(Ci) for samples drawn at random from lying beyond the a probability point for a(Ci) 
computed from samples drawn from a normal universe. The greater the power the more 
discriminatory the test. Accordingly, it is in theory necessary to know the frequency dis¬ 
tribution of a{o) for all sample sizes, for all values of c and for all universes. Considering that 
the only frequency distribution of the field contemplated which can he regarded as deter¬ 
mined for all sample sizes is a(l) for normal samples (Geary, 1935, 1936), many compromises 
are necessary to give any kind of practical effect to the power concept. The compromises 
proposed are as follows: 

(1) The form ai(c), given by (3-2), is used instead of the form a(c) given by (3*1). 

(2) Only large samples are dealt with. 

(3) The field of alternative universes is restricted. 

Using ai(c), the first four moments (from the origin) of a^{c) for samples from any universe 
can be expanded without real difficulty, and so approximate frequency distributions (using 
the Karl Pearson or Gram-Oharlier systems) can be obtained. As to (1), from experiments 
in a(l) and a(4) the writer has verified that, for medium-sized normal samples, there is little 
difference between the probability points (e.g. 0*01, 0*05) of a^{c) and a(c), though the higher 
semi-invariants (given n) are larger for the latter. In regard to (2) and (3) little confidence 
could be reposed in the values of the moments computed from expansions even to unless 
the sample number was at least of the order of 100 when c is greater than, say, 3; and, even 
if the moments were known exactly, the empirical frequencies would be more than doubtful 
for small samples. The approach finds its main justification in the consideration that any 
errors due to these necessary compromises may he presumed to apply more or less equally 
arid in the same direction to the tests of kurtosis compared; generous, perhaps too generous, 
advantage is taken of this justification in the concluding part of this section. 


Set, then, 
so that 


« = /“iciM®. Vi = (1 


(6-1) 

( 6 - 2 ) 


where 


(6-3) 
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the universal mean being taken as zero, without loss of generality. Raising (6-2) to powers 
1, 2, 3, 4, expanding to the required degree the final factor, multiplying by the first factor 
on the right, and setting down the mean value of each, term we find, to n~^, 

Milcc = l-i{^l«(H)-4«(02)} + ^ {^ii)(12)-4«[(03) + 3(ll)(02)] + 34«(02)2} 

+ ^ {4'*[3(11) (02) - (13)]+ Att)[(04) - 3(02)2 +4(11} (03) +6(12) (02)] 

~ 4^)[10(03) (02) +16(11) (02)2] + 154«(02)}, (6-4) 

= H~i{fc^®(02)~2^:f(ll) + (20)}+;i{-yt®(03) + 3/i:f(02)2 

+ 2/4®(12) - 6742>(11) (02) - 7if(2l) + ifcf|)(20) (02) + 2fc<|^(ll)2} 

+ - 3(02)*] - I0kf{02) (02) + I5)cf\02f - 2)fc®[(13) ~ 3(11) (02)] 

+ 4A;1*)[2(11) (03) + 3(12) (02)] - 30fc®(ll) (02)2 

+ 4®>[(22) - (20) (02)] - 4®[(20) (03) + 3(21) (02)] - 2i)®(ll)2 

-6lt®(12)(ll) + 127cf(ll)2(02) + 3/4*>(20)(02)2}, (6-5) 

= l+i{4®(02)-37:f(ll) + 3(20)} + ^{-7:f(03) + 3*f(02)2 
+ 3fcf (12) - 9fc®(ll) (02) - 3/cf (21) + 3/t|>(20) (02) + (11)* 

+ (30) - 3fcf (20) (11)}+4{ii:f[(04) - 3(02)*] - 10/cf (03) (02) + I5kf(02f 

71/ 

-3^f[(13)-3(ll) (02)] + 67;f[2(ll) (03)+ 3(12) (02)]-454®(ll) (02)* 

+ 3/4®[(22) - (20) (02)] - 34»[(20) (03) + 3(21) (02)] + 94»(20) (02)* 

- 64*>( 11 )* -184*>( 12) (11) + 364»)( 11)* (02) - 4*>(31) 

+ 4*)(30)(02) + 34*>(20)(ll) 

+ 34*'[(20) (12) + 2(21) (11)] - 94*>(20) (11) (02)-64*>(ll)*}, (6-6) 

M'^joc^ = l + i{4«(02)-44«(ll) + 6(20)} + ~{-4">(03) + 3/cW(02)* 

+ 4/4«(12) - 124«(11) (02) - 64«(21) + 64^)(20) (02) +124^)(11)* 

+ 4(30) - 124*)(20) (11) + 3(20)*} + -4{7;f [(04)- 3(02)*] 

71 

- mf(02) (02) -154«(02)* - ikf\{n) - 3(11) ( 02 )] 

+ 87cW[2(li) (03) + 3(12) (02)] -60fc«(ll) (02)* + 64*«[(22)- (20) (02)] 

- 6fcf[(20) (03) + 3(21) (02)]+ ISfcf(20) (02)*- 12/i:«>(ll)* 

-364«(12) (ll) + 727:f(ll)*(02)-44^>(31) + 44^>(30) (02) 

+ 124 «( 20 ) (11) + 124 ^>[( 12 )( 20 ) + 2(21)(11)] 

- 364«(20) (11) (02) - 244*>(11)* + (40) - 44«(30) (11) - 3(20)* 

- 6kf>(20) (21) + 34^>(20)* (02) + 124 *^( 20 ) (11)*), (6-7) 

]c(s) _ + + , (fg) = Eyiz^, 


where 
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the latter, of course, the same for all i. The {fg) required for the computation of (6-4)-(6>7) are 

(11) = (/t)2+cl 

( 02 ) = 

( 12 ) = + 

(03) = (/i6-3/t4,»SL+2/4|)//4,' 

(04) = (/i8-4/ie/4a+6/44/4-3/it)//^i 

(13) = [/4|8+(,( ~ 3/i|4+c|/“2 + 3/4|S-K!|/^1 ~ ^ 

(21) = [/4|25^.ai-2/4|<,+2l/4,^,|-/42(/i|2,|-2/4f,,|)]//4f(,|/42, 

(22) = (/4|254.4| — 2/4|2c+2|/42 +/^|2(!| ~ '^fHc+i\P'\c\ + ^/*lo+2l/ho|/*'2 ~ /^1 + PA’ 

(20) = (/4|2„|-/4f„|)Kc|, 

(30) = (/ 4 | 3 (,| ~ 3/i|2,,|/4|j,| + 2/i®,,|)//A|c|, 

(31) = (/4|3£.+2| ^3/t|2(,^2|/t|(;| + ^P'\c+1,\P^c\ ~/*'|3c|/^2T ^P'\%c\P‘\c\P'2~ ^^'|cl/^ 2 )//hel/^ 2 > 

(40) = (/i|4c| — 4:/t|3c|/t|o| + fi/tiacl/ifol “ 3/4 (‘o| )//4fcl. 

( 6 ' 8 ) is, of course, an immediate consequence of (6-3). The writer has checked the accuracy 
of formulae (6-4)-(6-7) by reference to the normal universe for c = 1. 

The reader will have no illusions as to the magnitude of the task of applying the foregoing 
theory to particular cases. The formulae are set down, however, in the hope that other 
researchers will be sufficiently sensible of the importance of the theory to assist in building 
up a fairly extensive set of results. The writer has to be content, in the meantime, to consider 
the case of the symmetrical universe field given by 



when A 4 = the normal being given, of course, for A 4 = 0 , and for c = 4 and 0 = 1 . These 
values of c are selected because the theory in § 6 has suggested that a(4) is probably the most 
efficient of the test-field a(c), while a(l) is the only member of the field for which the normal 


Table 8. Moments from formulae (6-8) 


(fa) 

li 

0 

0=:1 


Normal 

11 

Normal 

h=i 

(11) 

4 

5*428671 

1 

1*17021276 

(02) 

2 

2*5 

2 

2-5 

(12) 

24 

46*64286 

3 

4-88297871 

(03) 

8 

14 

8 

14 

(04) 

60 

138 

60 

138 

(13) 

216 

544-2857 

21 

44-106383 

(21) 

266/3 

177*71428 

M41593 

1-75644898 

(22) 

2,720/3 

2,481-92867 

7-707963 

14-766814 

(20) 

32/3 

16*142867 

0*570796 

0-63834981 

(30) 

362 

799-142857 

0*429204 

0-6405182 

(31) 

4,352 

12,785-2863 

3 

5-236134 

(40) 

23,552 

73,260-178 


2-002492 
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distribution is known for samples of all sizes. The necessary moments [fg) given by ( 6 ' 8 ) 
are shown in Table 8 . Based on the values in this table, moments {M') given by ( 6 - 4 )-( 6 - 7 ) 
of and semi-invariants (i) derived therefrom are as follows. The normal values are, 
of course, known exactly but were computed for the purpose of checking the formulae: 

c = 4; normal universe, 

3 3 n'^ n®’ 

2 48 1040- ^3^64 2368 

21^ '^n a® m® ’ 27“^ 

M'i^ 8 40 3520 1^4 _ 3840 

81 n® ’ 81 ~ 71 ® ' 


' c = 4; universal A 4 = |- 

3-357 11-822 12-1 

3-5 3-5 n Ti.® 71® ’ 


(3-5)® 

(3-6)® 

Ml 

(3-5)-* 


_ ^ 2-286 57-34 ^ 776-03 


n 77 ,® 77 “ 

3-215 107-47 2853-89 




— 1 + 


(3*5)2 

£3 


3 > 


4*4286 92*25 831*2 

r--- -- -]-;- 

n rv 

144*61 6193*95 


n 




w 


(3-5)® 


n‘ 


2 




3 


13*143 20*49 9529 
,1 + -+ 


10,587 


n 




3 f 


(3*6)^ 




0 ^ 1 ; normal universe, 


^ 0*19947114 0*02493389 0*03116737 

^ 0*7978845608 +-+ - 


n 






0*04507034 0*07957747 ^ 0*03978874 


n 






0*01685646 0*07613597 
71® ^ n® ■ 




/^lll /^H! 
^--1 + 


1 + 


c = 1 ; universal = | 
0*35239362 0*159616 0*745838 


n 




ifj . 0*79792429 0*458012 1*800648 0*09313705 0*262961 0*196477 


/^fil 


n 




'nr 


/^fif 


n 




n^ 


/^ll! 


a + 


1*336592 0*850081 3*239101 


n 




/4i\ 


0*053366 0*204164 

- +— - 






/4,ii = 0*78126197. 

Two sample sizes were considered: n = 100 and n = 500. For n == 100 and c = 4, the 
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following are the Pearson Type IV frequencies of ai(4) when the parent universes are normal 
and have respectively: 


Normal: A4 = 0. /c cos^^’^®®°0 dx^ 

tan0 = (cr---l-873387)/0*765849, • 
logiQ/c — 3*2644596. 


( 6 - 10 ) 


A4 = 

tanl9 = (a;-2*8622)/0‘9062, 
logio/c = T‘7499974. 


( 6 - 11 ) 


The normal probability points shown in column (2) of Table 10 were derived from the fore¬ 
going normal frequency (6*10); the points in column (3) were derived from a Gram-Charlier 
formula (Geary, 1935). The 0*01 and 0*05 points given in column (2) are practically identical 
with those given by E. S. Pearson (1929) for a(4), namely, 4*39 and 3-77. The powers given in 
column (4) are the aggregate frequencies lying beyond the values of the variate shown in 
colutnn (2) on the assumption that the actual frequency was (6*11). The corresponding 
figures for c = 1 given in column (5) were based on a Gram-Charlier formula. 


Table 9. Power of afc)for c = 4 and c = 1 of discriminating (6*9)/or A^ = ^from 
the normal (A4 = 0) at four normal theory ^probability levels. Samples of 100 


Normal theory 
probability 

(1) 

Normal theory probability points 

Power for frequency (6*9) with A 4 =:^ 

c = 4 
(upper) 

(2) 

C=1 

(lower) 

(3) 

c = 4 

(4) 

C= 1 

(6) 

0-01 

4-3836 

0*7482 

0*0648 

0*0696 

0-05 

3-7744 

0-7642 

0*1996 

0*1979 

0*10 1 

3-6196 

0-7726 

0*3163 

0*3037 

0*20 ' 

3-3110 

0-7824 

0*4626 

0*4697 


Before discussing the comparative powers in Table 9 it will be convenient to give a 
table, 11, on the same lines but for n = 500. On account of the larger sample size it has been 
necessary to change the reference-probabilities given in column (1). For the construction 
of this table Gram-Charlier formulae were used throughout—the probability points being 
determined from the E. A. Cornish & R, A. Fisher (1937) formulae—after verifying that 
for two of the probability levels, 0*01 and 0*05, the probability points for c = 4 (column (2) 
above) did not differ appreciably from those given by E. S. Pearson, namely, 3*60 and 3*37 
(for a(4)), based on a Type IV curve. 

The analysis in § 5 has enabled us to come fairly firmly to the conclusion that for indefinitely 
large samples a(4) was to be preferred to a(l) as ^ test of normality. We see from Tables 9 
and 10 that this is subject to an important qualification. Table 9 shows that the discrim¬ 
inating power is definitely greater for samples of 600 for a(4) than for a(l), but the superiority 
is less emphatic than might have been anticipated from § 5. Fox medium-sized samples 
(Table 9) a(4) exhibits no superiority. Of course, these conclusions are very tentative, 
as being based upon a single alternative and on particular sample sizes. The writer had 
proposed, in addition, to examine the universes (i) A3 = 0, A^ = 1 and (ii) A| = A4 = ^ as 
alternatives to the normal but time did not permit; he ventures to repeat the hope that other 
students will take the matter up. 
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Table 10 . Power of cii{c)for c = 4 and c = 1 of discriminating ( 6 * 9 )/or | 
from the normal (A^ = 0 ) at four probability levels. Samples o/500 


Normal 

probability 

(1) 

Normal probability points 

Power for. frequency (6*9) with A^^ J 

c = 4 
(upper) 

(2) 

0=1 

(lower) 

(3) 

ft 

W 

c= 1 

(5) 

0-006 

3-7062 

0-773167 

0-1934 

0-2067 

0-01 

3-6094 

0-776684 

0-2920 

0-2790 

0-06 

3-3766 

0-782482 

0-6966 

0-6196 

0*10 

3-2695 ' 

0-786058 

0-7392 

0-6609 


7. CONCLTJSIOIS' AND SUMMARY 

In § 2 of the present paper it is shown that the actual probability of differences between 
means and variances derived from random samples on the* nuhhypothesis may differ 
considerably from the probability derived from the standard tables (compiled on the 
assumption that the universal distribution is normal), when, in fact, the universal distribu¬ 
tion is not normal. Accordingly, the standard tables cannot validly be used unless tests, 
based on the sample from which the inferences are to be drawn, or on a series of samples 
produced under similar conditions, have established the likelihood that the universal 
distribution is approximately normal. In certain cases—but these must be few—the nature 
of the material may, of itself, suffice to justify the assumption of universal normality. 
When universal normality cannot be assumed, the best course will be to correct the standard 
tables using, for this purpose, the moments (up to, say, the fourth) derived from the sample, 
in coil] unction with the formulae givenin § 2 . This procedure is, of course, open to the ob j ection 
that the moments derived from the sample may, in fact, differ substantially from the (in 
general unknown) universal moments, so that any probabilistic inference derived using 
sample moments must be accepted with reserve. If 63 = 3-5, say, it would be safer to assume 
that the universal value /?2 is 3*5, than to hope (without other evidence) that it is 3, the 
normal value; it might be 3*75 or even 4, when, usually, the standard table probabilities 
will be still further astray. It should not be difficult to construct supplementary tables 
giving very approximate corrections of the standard tables, using the moment expansions 
given in § 2 , for different values of and /Jg. To compute unbiassed estimates of the latter, 
R. A. Fisher’s k statistics (1929) should, of course, be used. 

It may be asked if testing for normality and, when necessary, correction for universal 
non-normality is worth the trouble. To answer this question it is desirable to have regard to 
the logical position of the statistician, concerned with drawing inferences from samples, 
whose characteristic approach may be defined as reductio ad paene absurdum: if an event is 
highly improbable it must be regarded for practical purposes as impossible. St Thomas 
Aquinas’s* famous ‘certitude of probability’ is peculiarly apt as applied to the mental 
attitude of the statistician, from two quite different viewpoints, The first is that decision, 
and action based on that decision, for which there is not certainty, but merely probabilistic 
preference, is absolute. One does not say that one has a preference of 20 to 1 for Fertilizer A 

* ‘According to the Philosopher, certitude is not to be sought equally in every matter... .Hence 
the certitude of probability aufhces, such as may reach the truth in the greater number of cases, although 
it fails in the minority’ {8umma lla-llae q. Ixx, a. 2). 
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over Fertilizer B because the differences between the yields is at or near the 5 % probability 
point of some test functions: one necessarily decides without qualification that A is better' 
than B. 

The second aspect, which has the greater relevance in the present case, is that the statis¬ 
tician regards himself as endowed with ‘ certitude ’ when he laiows that if he repeated an 
experiment, as to, say, significant differences in averages, a great number of times, he would 
be in error in attributing significant difference when, in fact, there was none, in a predeter¬ 
mined proportion of cases. He has certitude as to the probability though his decision in the 
individual case may be wrong. What is curious is that decisions (which, in effect, are absolute) 
can be baaed on probability levels which vary with the temperament of the statistician froip 
perhaps a conservative 0-001 to a daring 0-1. For the particular statistician the probability 
level will vary Avith the case: for instance, the present writer would be inclined to suspect 
noa-normeiUty near the 10 % prohahility level of the a(l) table, whereas he would not be 
disposed to attach significance in, say, analysis of variance, until about the 2^ % level. 
Naturally the level will depend on the importance attaching to the decision. 

Since all the statistician usually requires fx'om the table of probability for a given measure 
of significance is whether, on the nul-hypothesis, the probability is 'small’, absolute 
precision is not necessary in the probability. If the probability is thought to be minute, say 
O'OOl, it does not matter if in actual fact it is 0-002 or 0‘0005. If, on the contrary, the standard 
table value is approaching the statistician’s level of decision it surely matters a great deal: 
if he thinks his judgment is likely to be erroneous in 1 out of 20 experiments it must be of 
importance if, in fact, the true probability is something like 1 in 10 or l.in 5. These are the 
kinds of contrasts that appear from § 2, from comparison of standard table probabilities 
with 'actual’ probabilities found when the samples were assumed to be randomly drawn 
from certain arbitrarily selected types of non-normal universes. The computed probabilities 
in § 2 admittedly make no claim to exactitude in most of the cases, since the formulae were 
strained by their application to small sample theory. The point is, however, that the estimates 
of the actual probabilities are unbiassed in regard to the 'normal theory’ probabilities: 

■ if the former could be closer to the latter, they might also be further away. 

There is one case which is in a quite exceptional category, namely that considered at the 
beginning of § 2. As far as the writer is aware, this case has never been examined theoretically 
before, despite the extreme simplicity of the algebra. It is shown that in the simplest case 
of analysis of variance, when the two sample numbers are of the same order of magnitude, 
the variance is proportional, approximately, to that quite a small measure of 

universal kurtosis materially changes the probability. Statisticians must hav-e been affected 
by a kind of hypnosis in favour of normal theory to have overlooked so trivial a point, 
a stricture from which the writer is not particularly concerned to exclude himself! An 
exception was E. S. Pearson (1931) who, on the basis of his results cited in § 2 (a), sounded 
a warning: ‘The illustration should serve to emphasize the fact that certain of the "normal 
theory” tests can be used with greater confidence than others when dealing with samples 
from populations Avhose distribution laws are not known.’ 

An interesting chapter could be written on the fluctuations in the attitude of statisticians 
during the past century on the question of the occurrence of the normal frequency distribu¬ 
tion in nature, a chapter, perhaps, in a large work on Fashions in the Sciences down the Ages. 
Amongst the following the historian may find the reasons for the prejudice in favour of the 
hypothesis of universal normality up to, say, the end of the last century: 
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(1) The fact that^ to a close approximation, it applies in a wide range of mathematical 
conditions. 

(2) The fact that the theory found practical applications predominantly in assessing the 
probability of errors in astronomical measurements and in games of chance where the 
mathematical model could reasonably be assumed to apply. 

(3) The beauty of the mathematical theory and the facility of algebraic manipulation in 
the function involved. 

(4) The general shape to the visual sense of such frequency distributions as were known, 
before imposed its discipline. 

With the development, about the beginning of the century, of the theory of moments, 
statisticians became almost over-conscious of universal non-normality. The concomitant 
semi-invariant approach had quite a different background. The difference between the 
moment and Karl Pearson curve system on the one hand and semi-invariants and the Gram- 
Charlier system on the other is fundamentally that for the former normality is a particular 
case like any other, whereas for the latter normality is basic and generative. Each system 
has its advantages and disadvantages as applied to the determination of frequency dis¬ 
tributions of which the lower moments are known. In fanciful terms one might say that in 
the ship Gram-Charher one might sail in perfect safety but only within limited, and more 
or less ascertainable, range of Port Normality, whereas in the good craft Pearson one can 
sail the seven seas—at one^s own risk.*®* 

Our historian will find a significant change of attitude about a quarter-century ago following 
on the brilliant work of P. A. Fisher who showed that, when universal normality could be 
assumed, inferences of the widest practical usefulness could be drawn from samples of any 
size. Prejudice in favour of normality returned in full force and interest in non-normality 
receded to the background (though one of the finest contributions to non-normal theory 
was made during the period by R. A. Fisher himself), and the importance of the underlying 
assumptions was almost forgotten. Even the few workers in the field (amongst them the 
present writer) seemed concerned to show that 'universal non-normality doesn’t matter’: 
we so wanted to find the theory as good as it was beautiful. References (when there were 
any at all) in the text-books to the basic assumptions were perfunctory in the extreme. 
Amends might be made in the interest of the new generation of students by printing in 
leaded type in future editions of existing text-books and in all new text-books: 

Normality is a myth; there never was^ and never will a normal distribution. 

This is an over-statement from the practical point of view, but it represents a safer initial 
mental attitude than any in fashion during the past two decades. 

As already indicated, the present work is incomplete, especially on the experimental side. 
The writer hopes that he has created a pnma facie case for the importance of testing* for 
normality. 

Summary 

(i) Inferences drawn from the standard (normal) tables of z and t may be seriously in 
error if the conditions in which the standard tables apply (the principal of which is that the 
universes from which the samples are drawn are normal) are ignored. 

♦ This comment must not be taken as applying to the problem of curve-fitting, i.e. to fitting a smooth 
curve to given frequencies, but to the problem of estimating the frequency function given the first 
few semi-invariants. 
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(ii) Sufficient conditions are given for the approach to normality, with increasing sample 
size, of the field of tests of normality a{c) (given by (3-1)) for c> 0. 

(iii) Many term expansions of the first four moments of a{c) for normal samples are given 
with practical applications designed to find the values of c for which the moments could 
be used with confidence to find the frequency distributions for medium-size samples; semi¬ 
invariants oia^{2-4:) and a^{4:) (a^{c) is given by (3*2)) are compared; correlations between 
ai(o) and cti(c') are examined. 

(iv) For indefinitely large samples and a wide field of alternative universes a(4) is found 
to be the most sensitive test of kurtosis and an analogous test of asymmetry g{c) is found to 
be most sensitive for c = 3, gf(3) being the familiar ^Jb^. 

(v) An examination of the relative efficiency of a(l) and a(4) from the Power Function 
point of view suggests that a(4) is increasingly to be preferred as the sample size increases; 
for samples of moderate size a(l) is probably as efficient as a(4). 

(vi) Throughout the paper a considerable range of formulae is given in case students may 
feel interested to carry the writer's researches a stage further so as to give a firmer basis to 
his conclusions or to modify them, It is suggested (§4) that the preparation of a table of 
probability points of a(2-2) for normal samples of different sizes be taken in hand. 
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THE STRATIFIED SEMI-STATIONARY. POPULATION 


By S. VAJDA 


1 . Constant population 

Let a set of non-increasing real values I, ■■■,I>w'Pn+\ = 0 be given, and let 
represent the probability of a person of age 0 surviving the i following years. Further, let 
represent the numbers of persons of age 0,1, ...,w living at time 4 = 0. We con¬ 
sider then the development of such a population during the years following 4 = 0, under the 
assumption that the probabihties remain the same throughout the period investigated. 
Only persons of age 0 are to enter the population, and the number of such entrants shall 

n 

be such that the total of the population is kept constant at a number J? = S At the end 

of the first year the survivors of the H persons who were alive at ^ 0 will be (if we put 

hIVi = ri, say) «-i ^ . „-i 

i=0 Vi 

and therefore the number of entrants at the beginning of the second year (i.e. at i = 1) is 

n—l 

i=0 

By the same argument the entrants at ^ = 2 will be 


and so on; generally 




U'-t 


= ^t_xPx - 0i_aP2 - ... - (pxPi-i - S nPi+i, 


i=0 


( 1 ) 


as long as ^<n, that is, as long as there are survivors of the initial population. For t>n 

we obtain ^ _ i>i+<Pi-iPi + i>t-iPs+ ■■■+i>i-nPn- (2) 

We want to find an expression for which must obviously depend on 
(2) is a difference equation for the function of t and can easily be solved. For this purpose 

consider the * characteristic equation^ 

-f + • • • "b ^Pn-l (^) 

Let this equation have the roots x^, where is a fc.^~foldroot and x^ =)= We have 

then as a solution of the difference equation (2) 

where and Pi{t) = ^^1 + 06^2^ +“•found from the 

initial population, i.e. from equations of the form (1) which contain the first n numbers of 
entrants (j>^, ^^2? • * • > ^7 inspecting, these equations, which are of the form < n) 

fl" = -h + ... + + • • • + '^n-iPn^ 
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that they are equiralent to 

£ S (S) 

Hence the can be fixed, dependent on the - IJp^ and thns on the initial population. 
We have thus proved; 

Ifapopulationwithanagedi8tributionlQJi,...,l^issubjecttosurvivalratesp^{i — 1,2, 
and if this population is kept constant by f)i entrants of age 0 at the end of the tth year,- then 
is given by (4), where the are the different roots of (3), and the oc^j must be found from the set (5). 

The population after t years will have the following age distribution: 

^t—XPv ‘••j ^t—nPn^ 

It can easily be proved that, if the p^ are decreasing (and not merely non-increasing), then 
for all the roots of equation (3) we have j (< 1 and that any real root must be negative. 
Hence the (f>^ will oscillate around their limit lim^^ = The age distribution of the popula- 

f = 00 

tion thus tends, again through oscillations, to which may be called the 

intrinsic stationary population. Obviously, if the initial population has already this dis¬ 
tribution, it will not alter any more and the number of entrants will be constant and = Hi. 
In such a case all = 0, and r^ — Zq, whatever the may be. 

On the other hand, if — p^^^ holds for one or more values of i, then we may get cycles, 
and this is easily seen for the equation ... -}-x-i-1 — 0. AH roots have modulus 1, 

and it depends on the initial population whether we are dealing with the stationary case or 
with periodic cycles. Ho tendency towards an intrinsic stationary population appears in 
such a case. 

Example, Let us assume that we have the following probabilities of survival: 

Pi P2 Pa Pi Ps Pe 

7/8 49/96 5/32 13/384 1/384 0 

The characteristic equation can then be written 

384a:5 + 336a;4 4*196a;3-l-60aj2 4*13a:-hl = 0, 

which has the five different roots 

“ i ± V"” -- i ± 

The initial population will be assumed to be 

Iq li Zg ^3 ^4 K 

859 1269 229 60 116 66 

which implies H^ = 1000 (approx.). Therefore the = IJp^ are 



’‘i 


^3 


^5 

869 

1450-38 

448-86 

319-14 

3406-42 

21420-90 


Trom = 1000 + +agOjg (Z? == 1, 2,5) we find 


*1 

aj 

^3 

“4 

70 + 60i 

-70-60i 

0 

0 


It follows that the number of entrants in the year Z (= 0,1,2,..,) will be 
ij>i = 1000 [^1 + (- 0-07 + 0-06i) (- H V- 4¥)'+ ( - 0-07 - 0-06i) ( _ J • 
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These numbers are given in the first row of Table 1, which shows the evolution of the whole 
population. 

Table 1 


t 

0 

1 

2 

3 

4 

5 

6 

7 

8 and after 

Age 0 

859 

996 

1025 

989 

1001 

1001 

999 

1000 

1000 

1 

1269 

752 

872 

897 

865 

876 

876 

874 

875 (=1000x7/8) 

2 

229 

740 

, 438 

608 

623 

606 

511 

511 

510 (=1000x49/96) 

3 

50 , 

70 

227 

134 

166 , 

160 

156 

156 1 

166 (=1000x5/32) 

4 

115 

11 

16 ’ 

49 

29 1 

34 

34 

34 1 

34 (= 1000 X 13/384) 

6 

56 ' 

9 

1 

1 

4 

2 

3 

3 1 

3 (=1000x 1/384) 


2578 

2578 

2578 

1 

2578 ' 

2678 

i 

2578 

1 

2578 

2578 

2678 


2. Two COKSTAKT POPULATIONS 

All this covers well-known ground.'-^ A new problem arises, however, when we consider two 
initial populations with two sets of probabilities of survival, say {i = and 

Piii = 1 , 2 ,..., nz)^ where Po — Po — ^ and4= 0, We ask now whether it is possible to 
keep both constant by the same number of yearly entrants. More precisely: 

Let the two equations 

H, ^ m 

= 0 and ^ 

1=0 i=o 

have the roots Xi, with multiplicities ,,.,kj. and -'-j Vs with multiplicities -‘Js 
respectively. No two or two are equal and no or y.^ is zero. Under what further con¬ 
ditions, concerning the x^s and the y's, can the expressions and then have the same 
numerical values for all integral values of i.e. 

{H^-E^) + iPSH-ip,{t)y\-(> for i = 0 , 1 , 2 ,..., (6) 

i=-l 

r 

where + S Pi(.>') with P^{t) = + • • • + 

i=i 

and = + with + 

i=l 

Suppose first that none of the x's equals any of the t/’s. Then it is known that the deter¬ 
minant of any set of equations of the system (6) is not zero. It follows that we must have 
= Hi and all a's and /?'s = 0, hence all P^(i) andP^(i) = 0. In this case the two populations 
must already be stationary and therefore identical with the intrinsic stationary populations 
which are implied by the sets and p^, respectively. 

' On the other hand, if some of the x's are equal to some of the y’s, say aJi = 2/i,..., 
and all the others are different, then we find by the same argument that and = P^ 

for the first m values of i, whereas all the other P^ and P^ are identically zero. (It is, of course, 
again possible that all the Pi and are identically zero and that we have, in fact, again the 
two intrinsic stationary populations.) 

If all x'b are equal to the ^/’s, with equal multiplicities, then the two equations are equal 
* It follows, for example, from results of P. H. Leslie (1945), 
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and the two populations must be identical, if they are to be kept constant by equal numbers 
of entrants. 

We have thus reached the following conclusion: If we assume that the two equations given 
above are not identical, and that the initial populations are not the intrinsic stationary ones, 
then they must be such that the two equations have some (but not all) roots equal and if we 
calculate the corresponding and (see (4) and (6) of the previous section), then those 
corresponding to the equal roots must be identical and the others must vanish. This includes 
the ease where the a’s and y’s are the same, but with different multipHcities, so that the 
Pi and Pi do not all extend to the highest power of t which would be admissible by (7) or (8) 
respectively. 

If the two populations are the intrinsic stationary ones, then the numbers of entrants will 
be constant (i.e. independent of the year) and the two constants will be equal if and Only if 

SJi Sli 
Spi" Spi’ 

where the first expression refers to the first and the second expression to the second 
population. 

Example. In the example used in § 1 we have = a 4 = 0, and we can therefore try to 
obtain a second population which is kept constant by the same numbers of entrants as the 
first one. We construct an equation which has again the roots and - -I, but not 

-i ± V" rs- equation is, for example, 

480a;5 + 396a:H218a;»+62a:2 + 13a: + l = 0, 

which has the roots + and -xV± V"loo■ implies that the probabilitiea 

of survival, i.e, p^, are 

33/40 109/240 31/240 13/480 1/480, 

and as the (and the r, = ^_^) are to be the same as in § 1, the initial population must now be 
h = ^iPi- Table 2 shows the development of such a population, and it wiU be seen that the 
first line is identical with that in Table 1. 


Table 2 


t 

0 

1 

2 

3 

4 

5 

6 

7 

8 and after 

Age 0 

859 

896 


989 

1001 

1001 

999 

1000 

1000 

1 

U96 


822 

845 

816 

826 

826 

824 

826 (= 1000x33/40) 

2 

204 

668 


462 

466 

449 

466 

466 

464 ( = 1000x 109/240) 

3 

41 

68 

187 

111 

129 

132 

128 

129 

129 ( = 1000x31/240) 

4 

92 

9 

12 

39 1 

23 

27 ' 

27 1 

27 

27 ( = 1000x13/480) 

5 

45 

7 

* 1 

1 

3 

2 

2 

2 

2 ( = 1000x 1/480) 


2437 

2437 

2437 

2437 

2437 

2437 

2437 

2437 

2437 
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3. Stratified popttlatioh: two grades 

The results of the previous sections will now be used for an investigation of the stratified 
population.* First, we consider a population split into a lower and a higher grade in the 
following way: 

We assume that all members of age 0 are in the lower grade only, but that all other ages 
may share in both grades. Apart from mortality, which operates on all members according 
to their age, we assume that at every age a certain proportion dependent on that age is 
‘promoted’, at the end of the year, from the lower into the higher grade. Our problem is 
to discover whether this can be done whilst maintaining the totals in both grades constant; 
naturally the grand total of the population must remain constant. 

It is sufficient to deal only with the lower grade, as the numbers at each age in the higher 
one can be found by subtracting those in the lower grade from the total population at that 
age. Now the lower grade is depleted by mortality and also by promotions. If the probability 
of remaining unpromoted until age i is then the probability of not leaving the grade in 
this period is say. Since all entrants into the population are at the same time 

entrants into the lower grade, our problem thus reduces to the following: 

Is it possible to find an initial population, stratified into two grades, such that, on the 
basis of mortality described by p^, the number of entrants every year necessary to keep the 
population constant is the same as that calculated on the basis of mortality‘-oum-promotion, 
described by ‘ 

We can apply our results in § 2 to this case by considering the lower grade and the total 
population as the two populations given. It follows that the lower grade can only be kept 
constant by that number of entrants which is necessary for the total population, if the latter 
is initially such that some of the which depend on it are either identically zero or at 
least do not extend to the highest degree indicated by the multiplicities of the corresponding 
roots in = 0. In order to find a suitable initial population for the lower grade it is 

then necessary to find an equation Sp^y^'-^^ ~ 0 which has the roots, with the necessary 
multiplicities, which appear explicitly in (j>i as calculated from the original equation, but 
which is not identical with it. The degree of Ep^y^^^ = 0 may be lower than or equal to that 
of Ep^x^^~~'^ — 0. If it is lower, then all members of the population will be in the higher grade 
at the highest age or ages. 

This condition is not sufficient, however. In view of the interpretation of the equation 
containing the these coefficients must be positive and, as the lower grade is a part of the 
whole, we must have Pi^Pi for all L But it is not necessary that we have also Pi^x<Pi^ 
If the opposite holds, this could still bear a practical interpretation. It would mean that 
reversions occur from the higher into the lower grade. 

If an equation with the necessary and sufficient properties can be found, then we take the 

= IJPi which we had to start with and construct the initial population of the lower grade 
by writing the number at age i as = r^p^ = liPilPi> 

It will be seen that in such a population the age distributions change with the passage of 
time (tending to a stationary limit) but that nevertheless all entrants have the same com¬ 
bined prospects of survival and promotion, (Thus from the point of view of a member of 
the community his position is the same as if he entered a stationary population. His chances 

* Cf., for the stationary case, with continuous changes, H. L, Seal (1945). 
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stratified semirstationary population 

of promotiou are unaffected by the changes in the age distribution of those in front of him. 
But the characteristics of the population as a whole, for instance the efficiency of the staff 
from the point of view of an employer may, of course, vary considerably.) Such a population 
will be called semi-stationo^ry. 

Example, The population shown in Table 2 can be taken as representing a lower grade 
within the population given in Table 1. The ratios = pjpi are then: 

tfl ^2 ^8 ^4 ^5 

1 33/35 218/245 62/75 4/5 4/5 

Table 3 is constructed by subtracting Table 2 from Table 1 and thus shows the com¬ 
position of the higher grade. 

Table 3 


■ 

n 


2. 1 

a 

4 


, 1 



Age 1 

73 



52 

49 


60 

50 

60 

2 

25 

82 

48 

56 

58 

66 

56 

56 

56 

3 

9 

12 


23 

27 

28 

27 

27 

27 

4 

23 

2 

3 


6 

7 

7 

7 

7 

6 

11 

2 

— 

— 

1 

—. 

1 

1 

1 


141 

141 



141 

141 

141 

141 

141 


4. Stratified population: more than two grades 

Let us now split up the higher grade as well. We have then, say, h grades, with grades 2 and 
above forming the aggregate which was simply called the higher grade in §2; grade 1 is 
identical with the lower grade of that section. 

We assume further that promotions from any grade into the next higher one take place 
at the end of every year and that every promotee into any grade has to stay there for at least 
one year. Thus in any population the lowest possible age of grade g is g —I. The actual lowest 
ages may he different, because the first promotion rates different from 0 may concern higher 
ages than these. The rates of promotion can be different from grade to grade, but depend 
within each grade only on the age,, as before. 

We shall again investigate whether it is possible to keep the total numbers of every grade 
constant, even if the age distributions of the grades are changing. 

We have seen that the age distribution of the total population, after t years, is- 

^t^nPn- 

The distribution of grade 1 is, at the same time, 

*l>t^ ■■■> 4l-nPn> 

and it is assumed that the set of is not identical with the set of Hence grades 2 and above 

will have the age distribution 

HPq-Po), ^t~l{Pl-Pl), ^l~n(Pn-Pn)- 

Let us assume that ^i^y{Pv—Py) is the first item in this series which is not zero. Clearly we 
have Then, as far as numbers of members (and not their Individual careers) are con- 
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oerned, this aggregate of grades 2 and above is equivalent to a population which has arisen 
from successive annual entrants <Pi-.v{Pp — Py) who have been subject to rates of survival 


3i = 


Pi>+i Pp+i 
Pv-Pv ’ 


U-v 


Pn-'^i 
fv~Pv ' 


It must be understood, however, that ‘survival’ is here a balance between deaths and 
promotions into the grade, so that these rates may very well exceed unity. 

The number of annual entrants into grade 2 is given by 


^i-APv-Pv) = + 

n ^ n 

where are the common roots of = 0 and 2 == 0 , with multiplicities 

is=0 i=?l 

and respectively, and where the are polynomials whose order does not exceed either 
— 1 or — 1 - (They may all be identically zero.) 

The are, of course, also roots of 2(5?^ —= 0 , with multiplicities given by the 

i^v 

smaller of k^ sundj^. 

We ask now if it is possible to construct grade 2 alone in such a way that its total remains 
also constant. The argument which has been used in §.3 shows that this is possible if another 
equation of grade n — v can be found whose coefficients say, are not larger than the corre¬ 
sponding (and Wq— 1), which has once again the roots multiplicities at 

least. If 9^1 + *.. 4* 0 ^,,^ = — V, then this is clearly impossible. If ^ 3 ^ 4 ... 4 is smaller than 

this value, then we can try to find such an equation. The initial population can also be then 
found, if we multiply the initial population of grades 2 and above by The grades 1 , 2 
and the aggregate of 3 and above can then be constructed and every stratum kept constant, 
but with changing age distributions. 

We can proceed in the same way and find at each step whether further splitting up is 
possible beyond 3 grades, 4 grades, etc. It is seen that in general, if = n—m, and if 
grade <7 starts in fact at age — 1 , then m 4 * 1 grades can exist. 

The smallest value of Zgi is 1, and in this extreme case n grades can be constructed, i.e. 
one less than the number of ages. The nth grade will then contain the ages n — 1 and n. 
Further, since is a root of a; — aii = 0 , the age distribution of this highest grade is 

Hi -h- (Hi 4- Xi- -HiXi- 


is, of course, negative). 

Example, We use again the same example as before. The characteristic equation for the 
whole population was 

and that for grade 1 alone 

The difference between these two equations gives the equation for grade 2 and above 

This equation has, of course, the roots which are common to the two 

characteristic equations of the fifth degree, and also a further root Now there'is 

17-2 
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a biquadratic equation with the three specified common roots and not larger coefficients 
(and having the coefficient of equal to unity), viz. 

The fourth, irrelevant, root is - 1/6. This equation leads to the following development: 

Grade 2 only 


t 

0 

1 

2 

3 

4 

6 

6 

7 

8 and after 

Age 1 

73 

43 

60 

62 

49 

50 

50 

60 

60 

2 

18 

60 

35 

41 

43 

40 

41 

41 

41 

3 

6 

8 

26 

14 

18 

19 

18 

18 

18 

4 

S’’' 

11 

4 I 


1 

_ 1 

6 

■ 2 

3 

3 

3 

3 


112 

112 

112 

112 

112 

112 

112 

112 

112 


Grade 3 and above 


Age 2 

7 

22 

13 

15 

15 

16 

15 

15 

16 

3 

3 

4 

14 

9 

9 

9 

9 

9 

9 

4 

12 

2 

2 

5 

4 

4 

4 

4 

4 

5. 


1 

1 

— 

1 

— 

1 

1 

1 


29 

29 

29 

29 

29 

29 

29 

29 

29 


Analysis into further grades is impossible in this case, .because the characteristic equation 
of the third grade does not have any roots apart from the three common roots of all previous 
equations. 

6. Pbomotioi^ bates bepekdebt on seniority 
We still consider more than two grades, but now we will assume that the promotion rates do 
not depend on the attained age but on the seniority, i.e. on the time spent in the grade, 
instead. In the lowest grade seniority is equivalent to age, because all members were sup¬ 
posed to enter at the lowest age only. If we consider again the two grades of § 3, but this time 
take note of differences in seniority, we find the following pattern: 


Age 

'Lower 

grade 

Higher grade 

Total 

Seniority 

0 

1 



0 ’ 







1 






<Pt-iPi 

2 







00 














Kote. if PilPi and hence to == 1* 
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If we consider now promotion from grade 2 into grade and if we introduce the prob¬ 
ability of not being promoted during s years from grade 2 {Uq — 1), we see that grades 2 
and 3 (including higher grades, if any) will have the following constitution: 

Grade 2 


Age 

Seniority 0 

1 


a—1 

1 





2 

^f-2P2(h'“ ^ 2 ) '^0 




X 

^aj) “^0 


... 








Grade 3 


Age 

Seniority 0 

1 


03 — 2 

2 

3 

^t-ip2{^o~^^x) (‘^0 “^i) 

<~3?^3[{^l “ ^ 2 ) * ("^0 ‘^^ 1 ) 

+ (^0-h) (Wi-W2)] 




X 

^t~xPaL{^x~2 1 ) (^0 4 / 1 ) 

+ (i!x-3-4-2) (Uj,-U^) + ... 

+ {to-h) (Wx-a-^x-i)] 

~h (^0 “ h) (’^a-3 ““ 


^t-xPx(io-h) K-“i) 






It follows by means of the same argument as before that grade 2 can be kept constant if 
we can find the such that the equation 

“ ^i) + ^2) (^0 “ ^i) "h • • • 


- in) + (^.1-2 - in^l) + (^0 ^ ^l) J == ^ 

has the same roots which were common to ^ 

^l) "t * “ “t Pni^O in) 


which is identical with the difference of the first two equations of degree referring respec¬ 
tively to the whole population and to the lowest grade. We must further insist that all 
must have non-negative values, not larger than 1. The coefficients of the powers of x must 
also be positive, but it is not necessary that ^ unless w^e do not admit reversions. 
If m is the number of common roots, then it foiiows again as in the iast section thatn-m-i-1 
grades could exist which remain constant under the Operation of promotions, but that their 
age and seniority distributions change. 

Example. Dealing once more with the same example as in the previous sections, we have 
to find a biquadratic equation 


K 1 - M) - fit) + (1 - ID A [(Ml - If) + (ft - tit) +(1 - If) ^ 2 ] 

+^[(ff -1)+(IH ~ ft) %+(If ' fit) +(1 - M) ^ 3 ] a: 

+Tk[(l-f) + (ff-t)“i+(tif-ff)^2+(ll-IH)%+(i-'f!)«4] = o, 
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or, if we use four significant figures in every fraction, 

4 - (0- 6417 + 0- 6833%) + (0-1973 + 0-1658% -f 0-1786%) *= 

-1- (0-01806 + 0-04274%+0-03693% + 0-03869%) x 
+ (0 + 0-001389%+0-003288%+ 0-002764%+0-002976%) = 0. 

This biquadratic equation must have the roots — i + V” ~ i- fourth root is 

called (-z), then the equation must be identical with 

Simple arithmetic shows then that 

% = 0-1429 + 1-7142Z, % = 0-0459 + l-9082z, 

% = - 0-1283+ 2-2566Z and % = 0-0019+ l-9962z! 

Now z must be at least 0-05685 to make % positive and it must not exceed 0-5, because 
otherwise the % would exceed unity. But then % will always be larger than %, unless we put 
2 = 1 which would mean % = 1 for all i and then there would be no members at all in grades 3 
and above. It follows that we must admit reversions from grade 3 into grade 2. We can then, 
for instance, take z = 0-2 and have 

%= 0-4857, %= 0-4276, %= 0-3230 and finally %= 0-4011. 

The biquadratic equation becomes 

x^+^x^-\-^x^+^x+-^ = 0. 

This is the same as the one used in § 4, and we can again write down the changing pattern of 
the population, but this time taking also seniority into account: 


Grade 2 


Age 




0 



1 

2 

3 

1 

73 





73 

43 

1 

1 

■ 

43 

50 




50 

62 




62 


12 

6 

— 

— 

— 

18 

40 

20 

B 

^9 

00 

23 

12 

— 

— 

36 

27 

14 

— 

— 

41 

D 

3 

2 

1 

— 

— 

6 

4 

2 

B 

^9 

8 

16 

6 

6 

— 

26 

8 

"3 ‘ 

3 

— 

14 

Wm 

3 

3 

3 

2 

2 

1 

1 

11 

4 

— 

— 


H 

0 

1 

— 

■ 

1 

— 

1 

1 

2 

1 

1 

6 

1 



fl 


■ 


87 

22 

3 

■ 


88 

19 

6 

— 

112 

88 

19 

4 

1 

112 

1 n 

1 

6 

6 

1 

7 and later 

1 

49 

_ 

* 

■ 

n 

49 

50 





m 




m 





50 

2 

28 

16 

— 


^9 

43 

27 

13 

— 

— 


27 

14 

— 

— 

41 

27 

14 

— 

— 

41 

3 

■n 

4 

4 



18 

10 

6 

4 

— 

19 

m 

4 

4 

— 

18 

■El 

4 

4 

— 

18 

4 

5 

1 

1 

— 

IS 

s 

2 

1 

1 

1 

— 

3 

1 

1 

1 

— 

3 

1 

1 

1 

— 

3 

■ 

i 

■ 


■ 

1 




i 

■ 

■ 

i 

19 

5 

IS 

1 


19 

5 

■ 

112 
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Grade 3 


Age 

0 

1 

2 

3 

2 

7 


_ 

_ 

_ 

7 

22 




22 

13 


' 


13 

15 




15 

3 

1 

2 

— 

— 

— 

3 

2 

2 

—. 


4 

■1 

8 

_ 

__ 

14 

4 

6 

_ 

_ 

9 

4 

4 

3 

6 

”T 


12 

1 

— 

1 

— 

2 


1 

1 

— 

2 

1 

2 

2 

— 

5 

5 

1 

2 

2 

2 


7 




1 

1 

— 

H 

H 

—' 

— 



1 





B 

B 

2 

— 

29 

25 


1 

1 


19 


■ 

— 

29 


7 

2 

B 

29 


4 

6 

6 

7 and later 

2 1 

15 

_ 

_ 

_ 


>’6 

16 




16 

15 




15 

16 




15 

3 ' 

4 

6 

— 

— 

— 

9 

4 

5 

— 


9 

4 

6 

— 

— 

9 

4 

5 

— 

— 


4 1 

1 

1 

2 

— 

— 

4 

1 

1 

2 

— 

4 

1 

1 

2 


4 

1 

1 

2 

— 


5 * 

H 

1 


i 

H 

1 

— 

—, 

— 

— 

— 


1 

— 

— 

1 


1 

— 

— 

1 

■ 


B 

2 

B 

B 

29 

21 

6 

2 

— 

29 


B 

2 

■ 



7 

2 

B 

29 


We find, as before, that farther splitting up of grades is impossible, if the total in each 
grade is to remain constant throughout the years. 


SXTMMAKY 

This investigation deals with a stratified population, which is subject to (i) mortality, 
dependent on age, and to (ii) promotion rates, indicating the ratios of members of a grade 
which are transferred to the next higher grade at the end of the year. 

Section 1 concerns a population which is not yet stratified and formulae are deduced to 
calculate the number of entrants at time t, necessary to replace yearly deaths and thus to 
keep the total of the population constant. This number depends clearly on the mortality 
rates and on the age distribution existing at time ^ = 0. In general the population tends 
towards a limiting age distribution, the ‘intrinsic stationary population’. 

Section 2 considers two populations and conditions are derived for the case that they need, 
every year, equal numbers of entrants to keep them constant. 

Section 3 introduces the stratified population. Both mortality and promotion rates 
depend on the age, and they are independent of the time t. Under certain conditions one of 
the two populations considered in § 2 can be taken as the whole and the other as the lowest 
grade in it. It is shown how and when entries into the grade can, at the same time, replace 
both losses due to mortality in the whole population, and to mortality and promotion 
depleting the lowest grade. This can also be described by saying that the totals of both grades 
can be kept constant at the same time, although the age distributions change from year 
to year. 

Section 4 generalizes the results of the previous section for a population consisting of 
h grades. If the population is spread over n ages, then it is shown that up to n -1 grades 
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A SIMPLE APPROACH TO CONFOUNDING AND FRACTIONAL 
REPLICATION IN FACTORIAL EXPERIMENTS 

By 0. KBMPTHORNE, Bothamsted Experimental Station 
Intbodtjction 

The design and analysis of factorial experiments was described in 1937 by Yates in consider¬ 
able detail. In his treatment Yates described first the 2^ system and then went on to deal 
with 3'^ experiments and experiments of the 2^3^ type. The 2«- system is capable of very 
easy explanation, but with experiments of higher order both the design and analysis become 
of increasing complexity. It is the purpose of this paper to present a general method by 
which factorial designs of the type may be examined, in respect of both confounding and 
fractional replication. The method wiU be described by explanation of the rules for the 2^^ 
and 3^ systems and corresponds quite closely to that given by Fisher (1942). The present 
approach presents confounding and fractional replication as different aspects of the same 
jirocess. Experimental designs suggested by Plackett & Burman (1946) are also discussed. 

The 2^^ system 

In this system all combinations of n factors each at two levels are tested. The totality of 
treatment combinations may be represented by the points of an ?i-dimensional lattice, each 
side being of unit length. Let the factors be - ^ mutually orthogonal 

axes 2 / 1 ... The point (000... 0) will then represent the control treatment, (1000 ... 0) the 
treatment consisting of at the upper level and all the other factors at the lower level, and 
so on. The treatment effect of a;^ is the difference of the means of the yields of plots receiving 
aq and those not receiving It is therefore the difference between the mean of the plots 
represented by points lying on the plane j/j = 1 and the mean of those represented by the 
points on the plane = 0. The interaction of and is the difference between the means 
of those plots represented by 2/a = L 2/2 1 or == 0, — 0 and those represented by 

2 /^ = 0, 2/2 = i £^i^d yi = 1 , 2/2 = difference of the means of those plots for which 

2 / 1 + 2/2 = 2 or = 0 (mod 2), 
and those for which 2/1 + 2/2 = ^ (mod 2). 

Similarly, the triple interaction of and is the difference between the means of those 
plots for which yi + 2 /a + 2 /a = 0 (mod 2), 

and those for which 2/i + 2 / 2 + 2 /a = i (mod 2), 

This process can be continued to the consideration of the interaction of 
is the difference between the mean of those plots for which 

+ + ^ (^od 2 ), 

and the mean of those for which 

2 /i + 2 / 2 '+‘ 2 / 3 +-*‘ +y 7 i= 1 (mod 2). 

In the 7 i-dimensional space parallel hyper-planes may be drawn containing the points of the 
lattice, such that the total yield forming the positive part of an interaction is obtained from 
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a. set of parallel hyper-planes equidistant from each other. Likewise the negative part is 
obtained from another set of parallel hyper-planes, each plane of which lies midway between 
two planes of the first set. 


The 3^ system 

With n factors at each of three levels the treatment combinations are given by an n-dimen- 
sional lattice, each side being of length two units and containing three points. The treatment 
contrasts may be described as in the 2^ system with some slight modifications. 

Any contrast in the 3^ system involves the comparison of three totals of the yields of 
plots, and may be represented by the comparison of the differences between the yields of the 
plots lying on three sets of parallel hyper-planes. For example, ii n — 2 the lattice is as 
follows: 

0 1 2 

0 

X/n 1- 

2 


The main effect of is the difference between the totals of yields of plots for which y-^ = 0, 
yx — \ and i/i — 2. The I component* of the interaction of and is the difference between 
the totals of the yields of plots for which 

2 / 1 - 2/2 = Vi-Vt. = L and = 2. 

The J component is given by the contrast between the yields of plots for which 

2/i + y 2 = and 2/1 + 2/2 == 2. 

Anticipating the extension to cases'when n is greater than 2, the equations for the I 
component may be written as follows: 

• 2/1 + 3), 

2/1 + 22/2 = 1 (mods), 

XxX^(I^)\ 2/1 + 22/2 = 2 (mods). 

If 0^1 and X 2 (and therefore 2/1 and 2 / 2 ) are interchanged, then X^Xiilo) is given by the 
equations 2/2 + 22/1 = 0, X^X^(Ix) by 2/2 + ^^X/i = L X^X^il^) by 2/2 + 21/1 = 2, all mod 3. But 
the equation 2/2 + 21/1 = 0 (mod 3) is identical with the equation 2 / 1 + 21/2 = 0 (mod 3), since 
^ 2 / 1+%2 = ^ (mod 3), whatever the values of 1/1 and i/g; X^Xx{Iq) is therefore equal to 
XxX.^(Iq), Subtracting the equation 2/2 + 2^2/i = l.(mod 3) from the equation 

^ 31 / 1 + 32/2 ?= 0 = 3 (mod 3), 

we get 1 / 1 -h 22/2 = 2 (mod 3); X^X^ili) is therefore identical with XiX^il^)- If obvious 
from the equations given above for the J component that = X^^X^iJi) i = 0, 

1 and 2. 

* Yates’s terminology for the components of interactions is used where convenient, hut it is more 
convenient to refer to and J 3 as Jp, and respectively. 
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Considering the case w = 3, it is easily seen that the second order interaction may be 
split into four parts each consisting of the contrasts between three totals. These may be 
represented by the following equations: 

(I) 2/i+ J/ 2 + 2/3 = 0 (mod 3), 

2 / 1 + 2 / 2 + 2 /a = 1 

2 / 1 + ^ 2 + 2/3 = 2 

(II) 2/1 + 21/2+ 2/3 = 0 (mod 3 ), 

2/1 + 22/2+ 2/3 = 1 
2 /i + %2+ 2/3 = 2 

(III) yi+ya + 22/3 = 0 (mods), 

2 / 1 + 2 / 2 + 22 / 3=1 

2/1+ 2/2+22/3 = 2 

(IV) yi + 21/2 + = 0 (mod 3), 

2 /i + 2y2 + 2^3 = 1 

2/1 + 22/2 + 22/3 = 2 

In order these have been named by Yates 

X, Y, W. 

It is interesting in passing to note the relations between Z, X, 7 and W for permutations of 
the order of the factors. It is obvious from (I) that Z is invariant for any change in order of 
the three factors Yj, X^ and X 3 . Interchanging and 2 / 3 , equations (II) become equations 
(III), so that ABG{X) = AG B{Y). The following interchanges may be easily verified (using 
the equation 82 / 3 + 82/3 = 0 (mod 3) where necessary): 

ABG{X) = BGA{Y) = GAB{Y) = ACB{Y) = GBA(X) = BAC(W). 

From the equations, it is clear that Z, X and W may be computed in the way given by 
Yates, since Z ^ Y = J{x^,I{x„x,)}, 

X = 7{a;i, I{x^, X 3 )}, W = I{xj^, J{x^, Xg)}, 

I{x^, 033 ) and J(a; 2 , x^) being evaluated for each level of The extension to the case n = 4 is 
again obvious; the main effects, two-factor and three-factor interactions, follow as in the 
above, and the four-factor interaction may be split into eight comparisons of three totals: 


I 

2 / 1 + 2 / 2 + ^3+ 2/4 = 

0 , 

1,2 

(mod 

3), 

II 

^ 1 + ^2+ 2/3 + 22/4 = 

0 , 

1,2 

(mod 

3), 

III 

2 / 1 + 2 / 2 +2^3+ 2/4 = 

0 , 

1,2 

(mod 

3), 

IV 

2 / 1 + 2 / 2 + 22 / 3 + 22/4 = 

0 , 

1,2 

(mod 

3), 

y 

2 / 1 + 22 / 2 + 2 / 3 + 2/4 = 

0 , 

1,2 

(mod 

3), 

VI 

1/1 + 22 / 2 + ^3+21/4 = 

0 , 

1,2 

(mod 

3), 

VII 

2/i + 2 y 2 + 2y3+ y^ = 

0 , 

li 2 

(mod 

3), 

VIII 

2/1 + 21/2 + 22 / 3 + 2^4 = 

0 , 

1,2 

(mod 

3). 
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As in the case of two factors, the eifect of permutations of the order on the components of 
X, Y, Z and W may be easily obtained. The four-factor interactions may be computed by 
putting the equations given above into the following form: 


II = J{xi, Y}, 
IV = J{x^, W}, 


V = I{x„ If}, 
VI = I{x„X}, 

VII = I{x^, Y}, 

VIII = /K, Z}, 


where the three ooniponents of W, X, Y, Z of x^, x^, x^ (in that order) are evaluated for 
each level of x^. 


The jp" SYSTEM 

The total of degrees of freedom, where p is a prime, in the analysis of variance of 

a 3 ?" experiment may be spht into (p™ —1)/(2) — 1) sets of (p- 1) degrees of freedom, the 
contrasts being given by the following h 5 rper-planes: 

?/i = 0,1,2, ...,p-l, 

2/2 = 0,1,2,...,p-l. 

Main effects . (mod p), 

'■•jP 1* 

2/l"tP2 “ “'jP ij 

T i 4 .* c ' V Vi'Y ^ f) •‘•)P“ i, 

Interactions of pairs of ^± 4 , / i , 

. , , i. (modp , 

factors, e.g. of and x^ 

.yi + (p-l)2/2 = 0,l,2,...,p-l. 

and so on to the interaction between aU the factors which is given by the hyper-planes 

«i 2 /i+«a 2 / 2 +® 32 / 3 +---+%Pn = 0,l,2,...,p-l (modp), 
where a^. equals 1 and a ^,each may take all values from 1 to p — 1. 


SiMPLIfflOATION OE NOTATION 

The p"'-1 degrees of freedom in the p” system may be split into (p" — l)/(p — 1) sets of 
(p- 1) degrees of freedom, given by the above hyper-planes, but it is only necessary to 
specify one hyper-plane of each set of the parallel hyper-planes. 

All the comparisons may be denoted by 2/i*2/a®> symbol meaning that the 

comparisons are given by the hyper-planes 

“i 2 /i + « 22 / 2 +...+a„p„ = 0,1,2, ...,p-l (modp). 

In order to obtain an enumeration which covers all the possibilities once and once only, it is 
necessary to use the rule that the factors are always written down in ascending order— 
i.e. yVy’jiyl'^, etc., such that i <j<k... and that = 1. 
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The 3^ SYSTEM in the revised notation 

As an example, the 3^ system will be examined in detail. The effects are represented by 
yvyvyz\ interactions between pairs, yiy^^yiytyiy^^yxyly^yz^yzyh interactions between 
all three factors, yi 2 / 22 / 3 > ViV^yv ViVty^^ Viylvl* -^^7 other combination of powers of the 2 /’s 
can be reduced to the above set. 

It is interesting to examine the interactions of the effects and interactions. In the case of 
the 2'^ system, Yates refers to the generalized interaction of two interactions ABGD and 
GDE say, which is ABE. The interaction of effects or interactions A and B consists of ^-8 
and AB^ in the 3" system. 

(а) The interactions of main effects are obviously interactions between pairs of factors. 

(б) The interactions of main effects and two-factor interactions with one letter in common 
are two-faotor interactions and main effects: e.g. the interactions of and are 

yly^ = yxyl, and ylyl^^y^, 

and the interactions of t/i and y^yf are yfyl = y^y^, and ylyf = y^. 

(c) The interaction between main effects and three-factor interaction are two-factor and 
three-factor interactions: 


Between 

^ Interactions 

2 /i and 2 /i 2 /a 2/3 

ytyeya ~ yii/iyl y’ytyl ~ 2/22/s 

Pi and nyiVs 

yhiV^ = 2/12/22/3. 2/12/22/3 = 2/22/2 

2/i and yiylyg 

2 /?^a 2/3 = yiViVl. Vivivl = VilA 

Vt and yiyly^g 

y\y\fi = 2/12/32/3. 2^2/S2/2 = 2/22/3 


{d) The interaction between two-factor interactions are exemplified in the following table: 


Between | 

! 

Interactions 

y^Vg and 2 / 12/1 

11 

2 / 12/2 = 2/2 

2/i 2/2 and y^yg 

ViiAyx 

2/i2/l2/3 = 2/i2/s 

2 /i 2/2 and ygy\ 

2/i2^S/?. 

yivlyi = 2/i2/3 


(e) The interaction between two-factor and three-factor interactions are exemplified in 
the following table: 
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The interactions between two-factor and three-factor interactions are therefore two-factor 
interactions in some cases and main ejGfects and three-factor interactions in the other cases. 
(/) The interactions between three-factor interactions are set out in the following diagram: 


Between 

■yiViVa 

ViViVl 

vivlyz 

2/12/2 2/5 

2/12/22/3 

—--1 

2/i2/2» 2/3 ! 

2 /i 2 / 3 » 2/2 

2/1. 

2 / 2^3 

ViViVl 


. 1 

yv VzVl 

2/12/3. 

2/2 

yivlvz 



— — 

2 /i 2 /l 

2/3 

vivlyl 




— 

-- 


Confounding 

Confounding or the allocation of treatment combinations to blocks implies the allocation of 
all the points of the lattice into sets, of points, such that the comparisons between 
these sets involve particular sets of ^ -1 degrees of freedom. The aim of confounding is to 
reduce the effect of soil heterogeneity by reducing block size, but ensuring that the block 
comparisons have little possible practical importance. 

If comparisons A = yx\y%^ ■••2/S” B = ... y^n are confounded, then so is their 

generalized interaction, i.e. all the products of these two, i.e. AB, For, if 

the treatment combinations for which aj 2 /i-f-agj /2 +equal to 0,1, 2,..., — 1 
are put into separate blocks and also those treatments for which J/i + y? 22/2 + • ” + PnVn 
equal to 0,1,2,..., p -1, then 2/2 + • • • + (^^ + Ay?^) y^ is equal (mod p) 

to 0,1,3, .,.,^”1 for all A from 0 to p — 1. 

The present approach to confounding of the 2”' system is identical with that given hy 
Yates and we proceed to consider the rather more complex case of the 3^ system. 

(a) 3^ system 

(1) hh blocks of 3^. Any three-factor interaction may be confounded. 

(2) In blocks of 3. We cannot confine the confounded degrees of freedom to three-factor 
interactions because the generalized interaction of any two reduces to a two-factor inter¬ 
action and a main effect. If two three-factor interactions, y^ytVz and yxVlVz confounded, 
the 8 degrees of freedom for blocks may be described as follows: 

B.F. 


y% 2 

ViVz 2 

yiViVz 2 

2/12/22/3 2 

8 


We can, however, choose three two-factor interactions and one three-factor interaction pair 
for OUT block comparisons. 
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( 6 ) 3* system in hlodks of 3 ^ 

It is immediately obvious that we can confound two two-factor interactions and two 
higher-order interaction pairs to give blocks of nine. The important point, however, is to find 
a design confounding only three-factor interaction pairs. 

We therefore evaluate the interactions of all pairs of three-factor interactions, which have 
two letters in common. These may be derived from the interaction of ^ 12 ^ 2^3 with the four 
three-factor interactions of and 2 / 4 , which are as follows: 


Interaction of 2 / 12 / 2 Z /3 

and 

ViViVi 

yiViVlyl 

and 

2 / 32 / 4 . 

Interaction of yiy^y^ 

and 

ViViVl 

yiViVlyi 

and 

2 / 32 / 4 . 

Interaction of 2 / 12 / 22/3 

and 

ViVlVi 

yivlyl 

and 

2/2^32/4. 

Interaction of 2 / 12 / 22/3 

and 

ViVlvl 

yxylVi 

and 

2 / 22 / 32 /i 


Obviously there are many designs for the 3^ design in nine blocks of nine plots confounding 
three-factor interactions. Those which confound four-factor interactions must also confound 
two-factor interactions. The names of the confounded interactions and their squares (each 
of which corresponds to the same grouping as the element itself) form a group with the 
identity and the equation y? = 1 ? for all and further work is presumably most promising 
on these lines. 


(c) 3® in blocks of 9 

There is no design confounding only three-factor or higher-order interactions. If one two- 
factor interaction can be sacrificed, a possible scheme of confounding is given by the 
following table of generalized interactions: 


Between 

2 / 12 / 22/3 

ViVlyl 

y-iTAVi 

y^yiyl 

''and'''\^ 





^32/42/5 1 

2 / 1 2/a 2/3 2/4 2/6 

2 / 1 2/a 2/4 2/? 

2 / 12/2 2/a 2/6 

2/l2/2y32/42/6 

ViVlyn 

Viysyl 

y^Vi 

ViV^yiVl 


This two-factor interaction is estimated by the comparison of three sets of nine blocks, 
and the accuracy of the estimate will be low. 

(d) 3® in blocks of 27 

We may, for example, confound the following: 

2 / 22 / 42/6 

2/12/22/3 2/12/12/82/42/6 2/ i 2/82/42/6 

2/12/42/6 2/12/22/I2/62/6 ViVlVsyl 

yivlytylyl Viylylva yiy^yly^ylvl 

2/22/32/I2/? 2/22/12/62/1 2/32/42/12/1 

Three three-factor interactions, six four-factor interactions, three five-factor interactions 
and one six-factor interaction are confounded. If j/e is omitted from all the above expressions 
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•we obtain a 3^ experiment in blocks of nine confounding one two-factor interaction, seven 
three-factor interactions, three four-factor and two five-factor interactions—that is, the 
design given above for the 3® system. 

Extension to more oomplicated cases 

Extensions of the above to more complicated cases should most easily be achieved by the use 
of group theory. The confounding of a design in blocks corresponds to a group of 
1) elements such that all except the unit element involve at least a certain number of 
letters. Eor most agricultural experiments each element should contain at least three letters, 
so that no main effects or two-factor interactions are confounded. The group is an Abelian 
group and if A and £ are elements of the group so are JS, AB ^,. The order of each 

eloment is p, and if A is aii element ao are the first (p—A) powers of A . This aspect is being 
followed, and it is hoped will yield results. 

Fractional replication in the 2^ system 

Some principles of fractional replication have been worked out over the past few years at 
Rothamsted (Finney, 1945). In the case of a 2^^ system, with factors ... say, a half- 
replicate might consist of those treatment combinations which form the positive part of the 
interaction Each function of the plot yields consisting of the sum of one-half 

of them minus the sum of the other half than corresponds to two degrees of freedom. 
Alternatively, each degree of freedom has one alias, and the aim in fractional replication is 
to design the experiment so that the aliases of effects which the experimenter wishes to 
measure are high-order interactions which could not possibly have practical significance. 

For convenience of presentation, we develop first the theory for the case of the 2^ system. 
Suppose that of all the points on the lattice for the 2^^ system, only those points for which 

are included in the experiment. Then the points on the hyper-plane yi = 0, also lie on the 
plane 2 / 2 + 2 / 3 “i’ • - + ?/n — likewise those for which = 1 lie on the plane 

2/2 + 2/3+ “* == h 

The contrast which we have denoted by yi is therefore identical with that denoted by 
••• 2/u* Again, if we suppose that only those treatment combinations are tested which 
He on the hyper-planes 

then the points will also lie oh the intersection of tliese planes which is given by the equation 

+ + + = 0 (mod 2), 

The points which lie on the hyper-planes 

+ = 0,1 (mod 2) 

will also lie on the planes 

K + ri)3/i4-(a2-h72)2/2 +...4- + =0,1 (mod 2), 

+ 72 ) 1/2 +.-4- (Aj. 4-7^)2/,^ = 0,1 (mod 2), 

(ai + + 7 i) J/JL + ( 0^2 + y ^2 4" 72 ) 2/2 + • • • + (^n 4- Art 4- Yri) J/n = 6,1 (mod 2). 

Changing to the simpler notation, these results may be obtained by equating to unity the 
symbols corresponding to the effects which the experiment cannot measure (as only treat- 



0 . Kempthornb 263 

ment combinations of the same sign in the function giving the effect are included) and 
multiplying the symbol corresponding to a particular effect by these symbols. Thus we put 

I = ‘• 2/^ - yl-^yi^ ... y«n+i?n, 

then the contrast y\iy^n 

is the same as those given by 

2/«i+ri2/aa+r2... 2/“n+yn, y(i+riy^^-^7i ... and yli-^Pi+riyP+h-^-r^ ... t^^n+^n+yn, 

where each power is reduced modulus 2. 

2^ SYSTEM WITHOUT SUBUIVISIOH INTO BLOCKS 
We now consider some of the possibilities of partial replication for the 2«' system. The basis 
of designs with fractional replication is the choice of an identity relationship; most of the 
possible relationships are of no value, and we consider only those which yield the least 
possible confusion between main effects and first-order interactions. 

Half-replication 

m = 3. If we take I = y^y^y^, then i/j = yiiy^y^ys) = yly^ys = l/^yi- Such a design which 
confuses main effects and two-factor interactions would not be of any practical use. 

71/ = 4. If we take I ~ yiy 2 .y^yii then the aliases are exemplified by 

2/i = 2/22/3^4 and == «/32/4* 

Such a design would not be used unless the experimenter were confident that two-factor 
interactions were negligible. 

71 = 5. If we take I = yxV^V^yiy^^ then the aliases are exemplified by 

yi^yzyzViVs and iJiy^^ysyiys. 

A half-replicate with five or more factors is feasible when there is no necessity to remove 
heterogeneity by the use of blocks, since main effects will have aliases which are interactions 
of four factors at least, and two-factor interactions will have aliases which are interactions of 
at least three factors. 

Quarter-replicaUon 

Bach degree of freedom wiU now have three aliases. For each value of n we give the identity 
relationship and typical alias relationships. 

= 4. J = 2/i2/2 = yaPi = y^y^yzyi, 

then 2/1 = 2/2 = 2/1^32/4 = 2/22/3^4 and 2 /i 2/3 = 2/22/3 = 2 /i 2/4 = 2/2 2 / 4 - 

,1 = 6. 1 = 2/12/2 = 2/32/42/5 = 2/12/22/32/42/5 (o^). 

or I = 2 / 12 / 22/3 = 2/32/42/5 = 2 / 12 / 22 / 42/5 (&)• 

(a) Gives 2/1 = 2/2 = 2/i2/32/42/6 = 2 / 22 / 32 / 42/5 and 2 / 12/3 = 2 / 22/3 = yiViVi = 2/22/42/6- 

(b) Gives 2/1 = 2 / 22/3 = 2/i2/32/42/b = 2 / 22 / 42 / 5 - 

m = 6. 7 = 2/12/22/32/4 = 2/32/42/52/6 = 2/12/22/52/61 

then 2/1 = 2 / 22 / 32/4 = 2/12/32/42/52/6 = 2/22/52/6 

and 2 / 12/2 = 2/32/4 = 2/12/2^32/42/52/6 = 2/52/6- 

51 = 7. 7 = 2/12/32/32/4 = 2/42/52/62/7 = 2/12/22/32/52/62/7; 

then 2/1 = 2 / 22 / 32/4 and 5 / 12/2 = 2 / 3 ^ 4 - 

«, = 8. 7 = 2/12/22/32/42/6 = 2/42/5^62/72/8 = 2/12/22/82/62/72/8; 

then 2/1 = 2 / 22 / 32 / 42/5 and 5 / 12/2 = 2/32/42/6- 

Designs in quarter replicate are therefore possible when n is greater than or equal to 8. 

Biometrika 34 
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High-obder ebactional replication 

In general, the existence of fractional designs of the 2^ system with fraction 2^^, which will 
be useful where information on all main effects and two-factor interactions is required, 
depends on the existence of a group of 2P elements, one element being unity and the other 
elements all containing at least five letters. No simple method has been found of enumerating 
such groups, but it is perhaps worth recording the following designs which appear to represent 
the greatest degree of fractional replication possible. 

{a) Eighth replication 

If we are testing ten or more factors at each of two levels, one-eighth of a replication will 
enable main effects and two-factor interactions to be estimated. An appropriate identity 
relationship is the following: 

1 = yMaViVs = = y^y^.y^y^y^^y^ 

^ 2/i2/32/72/fl2/io y^y^yhViy^yiQ = y%y^y^y^y^y\c ^yiy^Vf^iky^y^yx^^ 

Thus ten main effects and forty-five two-factor interactions may be estimated from a trial 
testing 128 of the 1024 possible treatment combinations. 

(6) Sixteenth replication 

If wo arc testing twelve or more factors a possib© identity relationship is the following; 

^ = Viy^y^yAy^ = yiy^yisym ^ 2 / 12 / 22 / 02 / 102/11 

= 2/32/4^52/92/102/11 = 2/62/72/82/92/io2/ii ^ 2/i2/22/32/42/52/62/72/82/o2/io2/n 

~ 2/12/32/62/92/12 = 2/22/42/52/62/92/12 2/22/32/72/82/92/12 “ 2/12/42/52/72/82/92/12 

= 2/22/32/62 /io2/ii2/i2 ^ 2/i2/4 2 / 5 2/62/io2/ii 2/12 == 2/i2/32/72/82/io2/ii2/i2 = 2/22/42/62/7 2/82/io2/112/12- 
In this case twelve main effects and sixty-six two-factor interactions may be estimated from 
a trial testing 256 of the possible 4096 treatment combinations. 

The extent to which these designs will be of practical value depends very much on the 
existence of a sufficient mass of reasonably homogeneous material to test the large number 
of treatment combinations without the necessity of dividing the material into smaller 
batches and using the device of confounding. An experiment involving say 256 different 
treatment combinations is not large by modern standards. At Rothamsted, for example, an 
experiment involving 200 distinct treatments on 300 plots has been carried out for some 
years: this experiment was, however, made possible by utilizing the elimination of the 
effects of soil heterogeneity by highly complex confounding; the design, in fact, consisted of 
three 5x5 lattice squares necessitating seventy-five plots, and each of these plots was split 
into four subplots. The advantages of testing twelve factors, say, at the same time under 
virtually the same experimental conditions cannot, however, be ignored. Such an experi¬ 
ment should have more value, other things being equal, than two distinct experiments each 
testing some of the factors. An examination has not been made of the possibilities of reducing 
block size by confounding for the above two designs, but it is probably necessary to sacrifice 
a few two-factor interactions. 

The relatioksiiir between fractional replication and confounding 

It is clear that fractional replication and confounding are different aspects of the same 
process. A 2’^- design of 2^ blocks may be described as a 1 in 2^ replicate of a 2^+^ design 
with no subdivision into blocks, by regarding the blocks as a 2'^ system in p factors. As an 
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example, consider the 2^ design in y^ and y^ laid out in four blocks of eight and 

confounding yxy^Vz^ yxy%ydlb\ superimposing two pseudo-factors and the 

experiment is a quarter-replicate of a 2*7 design in y^, y^, t/g, y^^ t/g, 6^, fig* The identity on which 
the quarter replicate is based is given by the equations 

K = ViViVi, h = 2/32/42/5. hK = 2/i 2/22/42/6 
or the equation I = = y^y^y^h^ = yxy^y^y^hh- 

If we examine this equation in the same way as in the previous sections, we find that the 
design depends on the fact that the aliases of the following type may be ignored: 

2 /i “ y%y^x “ y\y^y^y ^^2 ^ 2/22/42/5^1^29 
2/12/2 = y^h = 2/12/22/32/42/5*^2 = y^y^hK 

This example is worth pursuing. The design is frequently used with one replication only, 
the error being estimated from three-factor and higher-order interactions. We set out below 
the identity and 31 degrees of freedom together with all their aliases and their usual place 
in the analysis of variance—blocks (JS), treatment (T), or error [E), For convenience of 
printing we denote the factors tested in the experiment by a, fi, c, d, e instead of ^ 12 / 22 / 32 / 42/5 
and the block factors by x and y. Capitals are used for treatment effects thus conforming 
to present usage. 


I 

= ABGX 

-CDEY 

=:ABDEXY 


A 

^BGX 

^AGDBY 

^BDEXY 

T 

B 

= AGX 

= BGDEY 

^ADEXY 

T 

AB 

= CX 

= ABGDBY 

z=:BEXY 

T 

G 

^ABX 

^DBY 

= ABGBEXY 

T 

AC 

=:BX 

~ADBY 

= BGDEXY 

T 

BG 

=:AX 

= BDBY 

^AGDEXY 

T 

ABG 

= X 

^ ABBEY 

:==GDEXY 

B 

D 

= ABODX 

= GEY 

=^ABEXY 

T 

AD 

^BGDX 

^AGEY 

= BEXY 

T 

BD 

z=zACDX 

^BCEY 

:=AEXY 

T 

ABD 

=:CDX 

= ABGEY 

:=zEXY 

E 

CD 

~ ABDX 

^EY 

^ABGEXY 

T 

AGD 

= BDX 

^AEY 

= BGEXY 

E 

BCD 

■=^ADX 

^BEY 

^AGEXY 

E 

ABGD 


^ABEY 

= GEXY 

E 

E 

= ABCEX 

= GDY 

= ABBXY 

T 

AE 

^ BOEX 

^AGBY 

= BDXY 

T 

BE 

=^ACEX 

^BCBY 

= ABXY 

T 

ABE 

= GEX 

^ABGDY 

=:BXY 

E 

GE 

^ABEX 

^DY 

::^ABGBXY 

T 

ACE 

=:BEX 

=::ADY 

= BGDXY 

E 

BCE 

:=zAEX 

::=:BDY 

=zACDXY 

E 

ABCE 

^BX 

^ABBY 

^GBXY 

E 

DE 

= ABGDBX = GY 

^ABXY 

T 

ADE 

= BGDEX 

^AGY 

^BXY 

E 

BDE 

= AGDBX 

= BGY 

^AXY 

E 

ABDE 

^GBBX 

^ABCY 

^XY 

B 

CDE 

= ABDBX 


= ABOXY 

B 

ACDE 

=zBDEX 


:==BGXY 

E 

BGDE 

z:=:ADEX 

rzBF 

= AGXY 

E 

ABGDE = DBX 

:=zABY 

= 0X7 

E 


If we take for each linear function of the jrields the alias involving the smallest possible 
number of letters, but remembering that x, y are pseudo-factors, so that X, F and XT are of 

18-2 
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equal importance and therefore ZY should be regarded as a main effect and not an inter¬ 
action, we have the following allocation of contrasts to the three components of the analysis 
of variance: 

Blocks; X, 7, XY. 

Treatments; Af By G, D, E, 

AB^QX, AC^BX, BG==AX, 

GD^EY, DB^OY, OB^DY. 

, AD, BD, AB, BB, 

Error: AY, BY, DX, BX, AXY, BXY, OXY, DXY, BXY, AGD, BOD, AGE, BOB, 

The four threeTaotor interactions could equally well he regarded as interactions between 
two-factor interactions and blocks. It would be anticipated that these would be smaller 
than the interactions of main effects and blocks. The purpose of the present exposition is 
to give a clear statement of the possible interpretations of the results of an individual 
experiment., Further remarks on the problem of interpretation are postponed to a later 
section in the paper. 

An example oe fractional replication with goneoitnding 

A design which has proved of practical utility is the half-replicate of a 2 ® experiment arranged 
in four blocks of eight plots. 

Call the factors 2 / 1 , 2 / 3 } y^> 2 / 51 1h' Then the best confounding is that in which, using full 

replication, the block differences are all third-order interactions, say 

2 / 12 / 22 / 32 / 4 . 2/32/42/52/6 and 2 / 12 / 22 / 52 / 0 - 

But it is impossible to keep main effects and interactions clear with this confounding, 
whatever interaction is equated to the identity. 

If we take the confounded interactions to be of the type 

and the interaction 2 /i 2 / 22 / 32 / 42 / 52/6 be unity, then the following interactions are also 

confounded: , 

and 2/32/5. 

It will he found by enumeration of the possibilities that one first-order interaction must 
be sacrificed. All main effects and the other first-order interactions will have high-order 
aliases. 

It is interesting to examine this design in the same way as the 2® above for the relations 
between block-treatment interactions and treatment interactions. 

There are, in fact, only thirty-two independent contrasts, and it is simplest to enumerate 
these by operating on the identity relationship with the thirty-two possibilities for the 2^ 
system omitting y^. As before, we insert block pseudo-factors. For simplicity of printing we 
use A, B, (7, T>, E, F for the factors and X, Y for the block factors. Then 

I^ABODEF, X^ABG, Y GDE, XYABDE, 

and combining these into one relationship, we have 

I = ABGDEF = ABCX = OBEY = ABDEXY =: DEFX - ABFY = GFXY. 
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A complete table of the aliases for this design follows: 


J 

== ABODBF 

= AB0X 

^DEFX 

^QDEY 

^ABFY 

-ABDEXY 

=:0Fxy 


A 

zz bgdef 

^BGX 

== ADEFX 

^ACDEY 

zzBFT 

= BDEXT 

-AOFXY 

T 

B 

=^AGDEF 

= AGX 

= BDBFX 

= BCDEY 

zzAFY 

= ADEXY 

-BGFKY 

T 

AB 

zzODEF 

= OX 

^ABDFFX 

^ABGDEY 

:=^FY 

=:DEXY 

= ABOFXY 

T 

G 

zzABDEF 


^GDEFX 

zzDEY 

= ABOFY 

=.AB0DEXY 

-FXY 

T 

AG 

^BDEF 

= BX 

= ACDEFX 

= AEEF 

-BGFY 

= BOBEXY 

^AFXY 

T 

BO 

zzADEF 

= AX 

= BODEFX 

^BDEY 

^AGFY 

= AGDEXY 

^BFXY 

T 

ABO 

zz'DEF 

^X 

-ABODEFX 

ABBEY 

^OFY 

= OBBXY 

zzABFXT 

B 

D 

z=:AB0BF 

= AB0DX 

= EFX 

^GEY 

^ABDFY 

-ABEXY 

=; CDFXY 

T 

AD 

zzBGEF 

:=B0DX 

= AEFX 


:=zBDFY 

= BEXY 

zzAODFXY 

T 

BD 

^ACEF 

^AQDX 

==BBFX 


-ADFY 

=zAEXY 

^BODFXY 

T 

ABD 

= CEF 

= GDX 

= ABEFX 

=5 ABOEY 

^DFY 

= EXY 

zzABGDFXY 

E 

CD 

zzABEF 

= ABDX 

^OEFX 

^EY 

zzABODFY 

= ABGEXY 

=:DFXY 

T 

AGD 

zzBBF 

^BDX 

= ACBFX 

^AEY 

zzBODFY 

= BGEXY 

::zADFXY 

E 

BCD 

^AEF 

-ADX 

^BGEFX 

:=^BEY 

^AODFY 

-AGEXr 

^BDFXY 

E 

ABCD 

^EF 

^DX 

= AB0EFX 

^ABEY 

= CDFY 

= OEXY 

zzABDFXY 

T 

B 

= ABODE ' 

== ABGEX 

^BFX 

^GDY 

^ABEFY 

= ABDXY 

zz CEFXY 

T 

AE 

^BQDF 

=:BQEX 

^ABFX 

^AODY 

zzBEFY 

= BDXY 

rzzACEFXY 

T 

BE 

zzAQDF 

^ACEX 

= BDFX 

zzBGBY 

zzAEFY 

= ADX7 

= BOEFXY 

T 

ABB 

^QDF 

^GEX 

=ABEFX 

zzABQDY 

:=:EFY 

-BXY 

zz:AB0EFXY 

E 

CE 

=:ABDF 

= ABEX 

:=GDFX 

:=zDY 

zzABCEFY 

=: ABODXY 

zzEFXY 

T 

AGE 

= BDF 

=zBEX 

=AOEFX 

^ADY 

^BGEFY 

= BGDXY 

^AEFXY 

E 

BGE 

zzADF 

=zAEX 

=^BOEFX 

zzBDY 

zzAGEFY 

= AGDXY 

z=:BEFXY 

E 

ABQE 

zzDF 

^EX 

==ABCDFX 

^AJBjDY 

= GBFY 

:=0DXY 

zzABEFXY 

T 

DE 

= ABCF 

= ABGDEX 

' = FX 


= ABDEFY 

= ABXY 

zzODEFXY 

T 

ADE 

zzBOF 

^ BGDEX 

==AFX 

zzACY 

zzBDEFY 

= BXY 

^AGDEFXY 

E 

BDE 

zzACF 

= AGDEX 

= BFX 

=:BGY 

zzADEFY 

=^AXY 

:^BGDEFXY 

E 

ABDE 

zzCF 

^ODBX 


zzABCY 

:=^DBFY 

= X7 

zzABODEFXY 

B 

ODE 

=:ABF 

== ABDEX 

^GFX 


= ABCDEFY 

= ABCX7 

zzDEFXY 

B 

AGDE 

zzBF 

^SDEX 

^AGFX 

^AY 

zzBGDEFY 

= BGXT 

zz adefxt 

T 

BCDE 

^AF 

^ADEX 

-BOFX 

zzBY 

^ACDEFY 

= A0XY 

^BDEFXY 

T 

ABODE =.F 

^DEX 

= ABOFX 

zzABY 

zzGDEFY 

= oxr 

::^ABDEFXY 

T 


The partition of the degrees of freedom in the analysis of variance which would generally 
be made is the following: 


D-r. 


Blocks 3 

Tireatmerits: Main effects 6 

Interactiom 14 

Error 8 


31 


The table of aliases is condensed below by the omission of all aliases involving more than 
two factors—counting, as before, XT as a single factor as well as X and Y. 

Effects B have aliases of at least three letters, but 0 = BXY and E = CXY, 

Effects AD, BD, AE, BE have aliases of at least three letters, but 

AB^GX^FY, AC^BX, BO == AX, CD^EY, EF DX, 
QE^DY, DE^FX^OY, DF ^ EX, BF^AY, AF ^ BY. 

In an experiment in which block-treatment interactions cannot be assumed to be negligible 
in relation to the effects it is desired to estimate, the interpretation of most two-faotor 
interactions is difficult if not impossible. The following identities of practical interest exist 
for the terms which would be used to estimate the error: ACD, BOD, AGE, BGE have 
aliases of three letters and are either three-factor interactions or interactions between blocks 
and two-factor interactions, but ABD — EXY, ABE — DXY, ADE^BXY, and 
BDE - AXY. 
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This design is very similar in result to the fuUy replicated but confounded 2^ design 
described above. 


Fkactionai, replication in the 3" system 

Here we have to consider treatment effects assessed from powers of one-third of a complete 
replicate, Only those treatment combinations represented by points of the lattice lying on 
the hyperplane agi/a -I- .., -f = 0, or 1, or 2 (mod 3) 

will be included in a one-third replicate. 

A particular treatment effect is given by the differences between the means of those plots 
represented by points on the following three planes; 

AVi + ySji/z + • • • + LVn = 0 (“iod 3), 

A 1 / 2 + 1 (»iod 3), 

A2/i + Ai/ 2 + - + ^rtyn = 2 (mod 3). 

It is obvious that the points lying on the first plane will also lie on the planes 

(y?i ■+ Aai)-f -I- Aaa)-h.... -I- (^„ -h Aa„)y^ = 0 (mod 3), for A = 1 and 2; 

the points'on the other two planes will lie on these planes with 1 and 2 respectively on the 
right-hamd side of the equation. 

The aliases of each pair of degrees of freedom are therefore obtained by multiplication of 
its symbol by 
and by its square. 

As an example, suppose a third replicate of a 3^ design is based on the inclusion only of 
those treatment combinations represented by the symbol.yi 2 / 22 / 3 ( 2 /i-j-j /2 + 2/3 == ^ say), then 
the aliases are exemplified by the relationship yiVlvl — 2 / 22 / 3 * 

The ooKEOuHniNG or one EEPnioATE or a 3^ experiment in three 
BLOCKS or NINE PLOTS 

A frequently used design is the 3® in three blocks of nine plots, testing all combinations of 
three factors each at three levels. This design is formally a one-third replicate of a 3^ design. 
Suppose the factors are and and let blocks be denoted by the pseudo-factor b; 

a three-factor interaction of y^^ and 2 / 3 , say j/i j/g j/ 3 , is usually confounded in order to keep 
■ main effects and first-order interactions free of block effects. 

Then b ^ 2 / 1 ^ 22/3 or / = yiV^y^b^ since 6^ = 1. 

As in the case of the 2^ design, we work out the aliases of each pair of degrees of freedom: 
each pair of degrees of freedom will in this case have two aliases: 


yi = yiylylb ^y%yzb'^ 

2 / 22/3 = VivlyW 

= 2 / 1 *" 

y% = y^ylVibl = 

2 / 22/3 = VxVlb'^ 

= yxvlb^ 

Vi = Viv^yW = Viy^b^ 

2 / 12 / 22/3 = yxy-iv^b 

= b^ 

yiVi^ViViylb =ysb^ 

yxy%yl = VxV^b 

= Vib 

Viyl = yiy\b = y^ylb 

Vxvlyz = 2 / 12 / 3 ^ 

= Vib 

2 / 12/3 = 2 / 1 ^ 12 / 3 ^ =yib^ 

2 / 12/3 = yxVlb = yiVlb^ 

yxvlyl = Vxb 

= 2 / 22 / 3 ^ 



O. K^impthorne 269 

Here again the identities could result in difficulty in interpretation—as of course could 
have been.predicted from the examination of the possible arrangements in blocks of nine 
of the 3^ design. The main effects may be regarded as clear, and three of the first-order 
interactions. The remaining two-factor interactions could be ascribed to differential effects 
of the factors on the three blocks. The three-factor interactions which are not confounded 
with blocks are also ascribable to interactions of main effects and blocks and may therefore 
be used to form an estimate of the error of these effects. 


Geneeal remaexcs on ooneotjnding 

The device of confounding is used almost without exception in agricultural experiments in 
order to reduce the block size to twelve or less plots. As the above results indicate there are 
two aspects which then need careful consideration, [a] the estimation of interactions, and 
(6) the estimation of the experimental error. 

The main purpose of the factorial desigir is the estimation of main effects and interactions 
between pairs of factors and thence of the effect of any one factor in the presence and absence 
of each of the other factors. It is clear that when it is necessary to remove soil heterogeneity 
by confounding, the interpretation of a small experiment involving a few factors may be 
exceedingly difficult because of the possibility of block-treatment interactions. It is possible 
to use the rule that a large contrast should be regarded as the interaction between 
whichever pair of main effects is the larger, but this rule will break down in some cases when, 
for example, the contrast has two aliases A £ and CD, and effects A and G are large and 
B and D small. In the case of a series of experiments, a device which might be helpful is the 
use of permutations of the possible identity relationships, one at each centre. The modern 
emphasis in agricultural experimentation ia on series of experiments at various places and 
in several years, rather than on individual experiments. Interactions of pairs of factors 
will be estimated correctly from a large series of experiments if treatments are assigned 
at random to blocks. 

The evaluation of two-factor interactions for individual experiments depends on the 
assumption that block-treatment interactions are small compared with the experimental 
error, Yates (1935) examined several experiments for the existence of such interactions and 
found no evidence of them. Since that time a large number of experimental results which 
can be used to provide information on the question have been accumulated, and an investiga¬ 
tion of these has indicated that block-treatment interactions are negligible and may be 
ignored (Kempthorne, 1947). 

With regard to the estimation of error, in so far as tests of significance are of interest, it can 
be said that the analysis of variance does provide a test of significance of the hypothesis that 
the treatments have an overall effect different from zero. In agricultural experimentation, the 
term error is used to denote block-treatment interactions. Thus in the simple randomized 
block experiment, it is possible to evaluate the difference between two treatments from each 
block, and it is the variability of this difference from block to block which is regarded as the 
error. In general, as there are usually few blocks, and the error of each comparison would be 
determined with poor accuracy, the errors of all the possible independent comparisons 
are pooled to give a common estimate. If the treatments were duplicated at random 
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within each block, the analysis would be of the form (r being the number of blocks and 
t of treatments): 

D.F. 

Blocks r — 1 

Treatments t — 1 

Treatments by blocks {r— 1) (i — 1) 

Within blocks H 

2r«-l 

The component Vithin. blocks’ could more accurately be described as experimental 
error, but would not be used to evaluate the errors of treatment effects, since the experi¬ 
menter is interested in the constancy of treatment effects from block to block. There is 
therefore little point in actually carrying out such an experiment. In a factorial experiment 
with replication, the components which could be evaluated consist of replicates, effects and 
low-order interactionB, high-order interactions, and interactions of treatments and repli¬ 
cates. On the assumption that the sum of squares for interaction of treatments and 
replicates is homogeneous, the mean square for high-order interactions wiU include the mean 
square for treatments x replicates plus a component of variance due to high-order inter¬ 
actions. When only one replication is used, it is assumed that the component of variance due 
to high-order interactions is small, and that the high-order interactions mean square can be 
regarded as an estimate of error. It is important to bear in mind that an individual agri¬ 
cultural experiment can give information only for a particular set of experimental conditions 
and that it is known from experience that place to place and year to year variability is 
considerable. It w;ould therefore be uneconomical to utilize available resources to determine 
effects and their errors at a few particular places very accurately, but preferable to sacrifice 
replication at each place in order to have information over a large range of experimental 
conditions. 

Mixed systems 

It is not proposed to examine mixed systems of the type where p and q are primes, in" 
the present paper. It is clear, however, that the possibilities of complete confounding and 
fractional replication are very limited. A p'th. replicate must obviously include p'^~^ 
combinations of the m factors combined with all the combinations of the n factors. Eor 
the examination of treatment aliases the system may be regarded as the product of the two 
separate systems. Thus if p = 3, m = 2, g ==; 2, = 3 and the factors are 

a half replicate would be obtained by putting I = y'^y'^y'^. The aliases which result are 
exemplified by the following: 

Vi = yiVsyiy^ > ymy'^ = VxvM^ > 

2 / i 2/2 = VxV^y^yW^ > yk = 

Such designs with fractional replication or complete confounding are therefore useful only 
when the corresponding designs for the two separate systems are feasible. 

Comments on ‘the design oe optimum multieactoriad experiments’ 

In a paper entitled ‘The Design of Optimum Multifactorial Experiments’, Plackett & 
Burman (1946) put forward designs more specifically for physical and industrial research, 
which are of interest from the point of view of fractional replication. In order to estimate the 
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effect of varying nine components, of an assembly, each component having two possible 
values, a nominal ( —) and an extreme ( + ), they put forward the following design which 
requires the testing of sixteen assemblies: 



1 

2 

3 

Components 

4 6 6 

7 

8 

9 

Assembly 1 




— 

4- 

— 

— 

4 

4 
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4* 
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— 
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4- 
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4- 

4- 

4- 


-- 
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4* 
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4 

— 



7 

*-* 

4* 

— 

4" 

4- 
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4 
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— 
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— 

— 
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Yates put forward a similar design in his 1935 paper for the weighing of a number of small 
articles on a balance which required a zero correction, as an example of the estimation of the 
effects of independent factors. In his case there was a close formal analogy to the 2™ factorial 
system, and it will now be shown that Plackett & Burman’s design given above is a high- 
order fractional design of the type discussed in the present paper. 

Denoting the nominal values by unity and the extreme values of the nine components by 
a, b, c, d, e,f, g, h, h in order, the treatment combinations represented are Z, a&M, abfi, abcg, 
abcdh, bcdei, acdef, bdefg, acefgh, abdfghi, boeghi, cdfhi, adegi, befh, cfgi, dgli. It is found merely 
by one-by-one examination of the three-factor interactions that all the above sets of treat¬ 
ment combinations occur with the same sign in the following: 

ABE, AGK, BOF, CDQ, DEH. 

The same will be true for all the members of the Abelian group of which the above five inter¬ 
actions are generators. The identity relationship is therefore: 

I =ABE =AOK ABOEK = BOF =AOEF = ABPK =EFK 

= ODG = ABGDEO == ADGK = BDEGK = BDFG = ADEFG = ABODFGK = ODEFOK 
=ABDH == AODEHK = BODHK = BODEOa = AODFH ^ ABDBFHK =DFHK 
= OEQH-ABGOH =AEGHK =BQHK =BEFQH =AF6H = ABOEFGHK = OFQHK 

The identities of interest to the experimenter are the following: 

■ I = ABE = AGK = BGF = EFK = CDG = DEH) 
from these we derive the following aliases for main effects: 

A==BE==GK, F = BG^ EK, 

B^^AE^GF, = GD, 

G = AK = BF == DG, H = DE, 

D=GQ = EH, K=AG = EF. 

= AB = jFiC = DH, 

In all oases, the contrasts estimating main effects are minus the contrasts estimating inter¬ 
actions. If, for example, the interaction of B and E is negative, and A has no effect, the 



272 Confounding and fractional replication in factorial experiments 

conclusion drawn by the experimenter will be that J bas a positive effect, It is possible but 
rather difficult to imagine physical systems in which effects will not interact, and interpreta¬ 
tion of the results of experiments based on this design may often be impossible. With nine 
factors, it appears from the present work that the minimum number of combinations which 
should be tested is 12g, that is one-quarter of a replication, though it is possible that by making 
less stringent assumptions about two-factor interactions, one-eighth of a replication might 
give intelligible results. A possible instance in which it might be feasible to use the designs 
discussed is when it is expected that only one or two of the factors have an effect, and the 
problem is to determine as quickly as possible which of the nine factors are responsible. 
An example in which a high-order fractional design was used in such circumstances with good 
results has been described by Tippett (1936). A detailed examination of all the designs put 
forward by Plaokett & Burman will not be undertaken, but the lines on which such an 
examination would proceed and the broad conclusions which would emerge are obvious 
from the above examination of one of their simpler designs, 


Conclusions 

A method of examining fractional replication and confounding for some types of factorial 
experiments is described, The formal equivalence between the two is indicated and the 
implications of this equivalence discussed. Further progress will follow on group theory 
lines and this is being examined, together with the possibility of fractional replication when 
the fraction is greater than unity. The possibilities are explored of the estimation of main 
effects and two-factor interactions of many factors by testing only a small proportion of 
the possible treatment combinations. An examination on these lines is made of designs 
proposed by Plackett & Burman. 
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A COMPARISON OP STRATIFIED WITH UNRESTRICTED RANDOM 
SAMPLING FROM A FINITE POPULATION* 

By P. ARMITAGB, B.A. 

1. Introduction 

1 *1. We are concerned in this paper with the problem of estimating the mean value jx 
of a variable a; in a population, by taking a sample which is in some way representative of the 
population. It has been realized since Bowley’s paper (1926), and more particularly since 
Neyman’s more comprehensive survey (193i), that a certain degree of precision in the 
estimate can often be obtained more economically by stratified random sampling (usually 
referred to merely as stratified sampling) than by unrestricted random sampling (usually 
called merely random sampling). In the stratified method, the population is divided into 
several strata, the sample size divided in some prearranged way among the strata, and 
sampling performed at random from each stratum. In unrestricted random sampling, a 
random selection is made from the whole population, and the method may be regarded as 
a particular case of stratification, where the number of strata is one. 

Some text-books deal briefly with stratified sampling. Wilks (1943) considers only 
infinite populations, and denotes by representatiye sampling what we should call a par¬ 
ticular type of stratified sampling (see § 1*2). The subject is treated by Kendall (1946, 
pp. 249-52), but he makes no comparison with unrestricted random sampling. We shall 
begin by introducing several well-known results which will be needed later, 

T • r 

1 *2. The summation sign S will be used throughout for S , and S for S . In general, 

k 

Yi is used for a single summation, for a double summation, and the suffix k where no 

k 

summation is involved. 

We shall consider the following position; A population tt of size N is subdivided into r 
strata, of size (SJV^ - N), The variable x is distributed so that the mean and variance 
(divisor Nj^) within are respectively cr|. It is required to estimate p = ^N^PijN, the 
grand mean. 

Suppose a given sample size, n^ is divided so that items are sampled at random from 
(21% = We may denote the jth observation from the ifcth sample by j = 1,2,,.., %), 
and the mean and variance of the kth sample by and 5^, which are known to be unbiased 

estimates of and respectively (see, for example, Kendall, 1943, p. 284). 

It seems intuitively obvious to take as our estimate of p, 

m = ^NjXjIN, (i) 

which is clearly unbiased. This is, however, not the only unbiased estimate which is a linear 
function of the For instance, '^NiXiJN also satisfies the conditions. Neyman (1934) 
has shown that, for fixed values of the estimate given by (i) is the best Mnear unbiased 
estimate of p, in the sense that its sampling variance is less than that of any other linear 
unbiased estimate. 

* Communication from the ITational Physical Laboratory. 
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CompaHson of strcitified with iinrestficte,d random sampling 

The question, now arises: given a sample size n, how shall we choose the so as to minimize 
var (»u), where m is given by (i)l Bowley had not considered ‘best’ estimates, and he sug¬ 
gested thatTij. should be proportional to Nj^, i.e. 


n 


k 


nNjc 

N ’ 


(ii) 


Neyman (1934) showed, by the method given in § 2, that the values of w*. which minimize 








(iii) 


where 0 -^ = (rj,V[iVft/(iVfc-1)]. 

We shall refer to these two methods of defining the %, by (ii) and (iii) respectively, as 
proportionate sampling, and optimum stratified sampling, denoting by and the estimates 
of p obtained from (i) by the two methods, and by x the estimate of p given by the mean of 
an unrestricted random sample of n from the whole population it. 

The optimum stratified method thus requires a knowledge of the ctj.. In practice, we should 
never know the cr^. exactly, unless the population had been subjected to exhaustive sam¬ 
pling, in which case/<. would be known exactly. Sukhatme (1936) has shown that, at any rate 
for large -?4, if the ul are estimated from a preliminary sample, and the defined by using 
these estimates in (iii), there is a high probability that var (m^,) < var (m^,).* The efficiency 
of this method will of course depend on the size of the preliminary sample, and Sukhatme’s 
investigation only dealt with one value of this (16 from each stratum), In some cases we 
should be able to form a fairly good estimate of the cr^, from past experience, and there would 
be no need for a preliminary sample. 

Another interesting comparison which has not been extensively investigated is that 
between optimum stratified sampling and unrestricted random sampling. Wilks (1943) deals 
with this for infinite populations, and obtains (pp. 88, 89) the result (in our notation), 


var (pfif) ^ var [m^) < var (a:). 


(iv) 


the first equality holding only when all the are equal, and the second only when all the 
/tft are equal. (Our NfiN are replaced by p^, where pj, is the probability that x, when drawn 
at random from tt, is a member of n,^, so that, for instance, (iii) becomes 


Representative sampling as defined by Willrs is what we should call proportionate sampling.) 
We shall show in § 2 that for finite populations, while the relation 


var (nig) < var (m^,) 


(V) 


is always true, the equality holding only when all the cr^ are equal, it is not necessarily true 

var (jrijj) < var (a:), (vi) 

* No confusion, need arise from the fact that the symbol and the term optimum are still used 
when estimates of the Xif are used in (iii). * 
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and in fact in the limiting case when all the are equal, it is true that 

yar {m^) > var (5 p), (via) 

so that if the are also equal var {m^) > var (S); (vi6) 

i .e. random sampling gives a more accurate estimate of the mean than any stratified samplin g. 
We shall see, however, that in almost all practical oases (iv) is true. 


2, Derivation of formulae 

2 '1. Results (iii) and (v). Using the notation of § 1-2, we have the standard result that 






(see e.g. Wilks, p. 86). 


Therefore from (i), 


var(m) = S^(|3f) 


(vii) 


(viii) 


The result (iii) may he obtained quite easily by finding the values of the Uj which minimize 
(viii) subject to the condition using the method of Lagrange multipliers. Then, 

substituting (ii) and (iii) in (viii), and applying Schwarz’s inequality, we have (v). The 
following method is due to Neyman. 

It may be verified from (viii) that 


(ix) 


If we denote the three tei’ms of (ix) by J., S and G, so that 

var (m) = 4 + JS — 0, 

it will be seen that A and G are independent of nj^ and, since B is non-negative, it follows that 
the values of which minimize var (w^) must minimize -B. Now P = 0 if and only if 

whiol:| is (iii). For these values of m = wio, and 

var (TOq) = A —G. (x) 

If we define n^. by (ii), so that m == nip, we see from (ix) that jS = <7, so that 

var(mp) = u4. (xi) 

From (x) and (xi), we obtain (v), the equality holding only when 0 = 0, which is true only 
when cTfc — = 0 for all k, i.e. when the cr/j are all equal. 

2 -2. Unrestncted random sampling- The variance of a random observation x from n is 


cr^ 


N 


■■ + s. 
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where S is the weighted sum of squares of the IH, i.e. S = (vii), 

"■<“’> = >7 If:u; 

ay,of+ s 


nN{N~l) 

N-n 


From (xi), 


nN{N-l) 

N-n 

var(mp) = 


n{N-l) 
N 


S(-^ l)t/,®+ 




Denoting by H, and by K, we have 


and 


N-n „ N-n 

var (x) = -nn— + 717?— 

^ ' Nn {N-l)n 

N-n ir 


var (m ) = 


Nn 


'K. 


(xii) 


Now 


H-K = 


N(N~1) 


and if we regard each Ni as being of the same order, 0{N), then H—Kia 0{N''^), which means 
that when all the are equal, S - Q, and so 


var (S) < var (mj,), 

which is (via); but as N -> co, var (») ~ var (mp) + 8jn, (^iii) 

giving Wilks’s result (p. 88) that for infinite poi^ulations (vi) is true, the equality holding 
only when all the /tj, are equal. 

From (v) and (via) it follows that for finite populations, when all the are equal and all 
the cr(. are equal, (vi6) is true, i.e. in this case unrestricted random sampling is actually 
better than any stratified random sampling with the same sample size, . 


3. Generau comparison 

3 -]. From (ix), (x) aiid (xii), 

-™'I*)-® 

where P = N^'^Nibf — NC^Nic/i)^ > 0 (equality if all cr[ are eqiial), 

As iV’-'’>oo, P, Q and B are xespectiYely 0{N^), 0{N) and 0{N^)j and so we have the result 
that for infinite populations ^^0, which with (xiii) is easily seen to be equivalent to Wilks’s 
result (iv). 

In the finite case, however, by suitable choice of the cr[ and n we can make ^ either positive 
or negative. For instance, if the o-J are all equal and n is sufficiently small, B predominates 
in (xiv), and <^ < 0. As n increases to N, (f> increases to 0. (By considering Q and iJ, it is not 
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obvious that ^ 0 in this case, but it must be remembered that (xiv) is only true if the % 

are given by (iii), and this becomes impossible as n approaches N, This will be remarked 
upon below.) If the crj are sufficiently unequal, P will predominate and $5 > 0 . In this case 


the factor - 


■ (xiv) will be positive, and 56 will decrease as n increases. 

The situations, then, in which (vi 6 ) is hkely to be true (provided that the are really 
given by (iii)) are when the are nearly equal, and when N is small or the cr^ are nearly 
equal. We shall consider some examples in § 4 . 


3 * 2 . In applying the procedure of stratification, we shall make two departures from the 
theory outlined above which will tend to nullify the advantages of the stratified method. 
The first is that, as was pointed out in § 1 - 2 , we shall never know the exactly, and the 
degree to which our estimates from which the were obtained are accurate depends on the 
circumstances. It seems quite likely that Sukhatme’s result will be fairly well applicable to 
finite populations, but there is an opportunity for research on this point. 

The second respect in which we depart from theory lies in the fact that, even if the cr^ are 
exactly known, the that we choose can never be exactly as given by (iii); first because they 
must be integers, which makes a considerable difference when n is small (the size of the 
smallest stratified sample from which an unbiased estimate of jx can be made is clearly r); 
and secondly, nj^ cannot take values greater than N^, In this latter case, if the values of, say, 
$ of the %, as given by (iii), are greater than the corresponding iV^, we should let = Nj^ 
for these s strata, and then set the other (r - s) values of ni^ proportional to the corresponding 
iVfccr^. This will clearly decrease var (^) — var (m^,) as given by (xiv). For example, when 
n = iV, we have , ^ . 

var {x) = vai (mj = S = Q, 

bub the right-hand side of (xiv) 


(equality holding if all the crj are equal). In fact both these limitations will decrease the 
theoretical advantage (if any) of stratified over random sampling, and we must take them 
into account in assessing the relative merits of the two methods. 


4. Examples 

In the four examples illustrated by Eigs, 1-4, var(mo) and var (^) have been calculated for 
different stratified populations, and ^ = logio{var (*)/var (mg)} plotted against c = m/JV, 
so that ^ < 0 if var (®) < var (m^), In each figure the different curves represent populations 
with the same ctj^, with the Nj^ in the same proportions hut with different magnitudes, and 
with the equal, so that S = 0 . 

Example 1. crj. = 2 ,3,4, JV),, cx) 6 ,4, 3 (iV = 66,26,13). 

Example^,. 0 -^ = 4,5,6, N^.ooQ,5,i (iV = 120,60,30,15). 

Example 3. crj, = 4,6,6, J4co3,ll,4 (i\r = 126,54,36,18). 

Exam2->lei. cr^ = 1,1,2,3,4, iV;.oo5,5,1, 2,3 (.^= 128,32). 

The first thing to be noticed about the graphs is that in each one ^ increases, generally 
speaking, as n increases. Further, in any one example the range of c, for which ^ < 0 increases 
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as N decreases; and in this sense we can say that for small samples of proportionate size 
from a stratified population, the advantage {if any) of the stratified method decreases as 
N decreases. 

Secondly, the curves are not smooth. The reason for this is clear. In the optimum stratified 
method the n/i are to be chosen approximately proportional to (a second approximation 

is In Example 1, the are all equal, and it follows that the should ho 

nearly equal. If ft ^ 0 (mod 3) this can be done, but for % s 1,2 (mod 3), var takes values 
greater than it would if fractional rij^ were allowed. This produces a rise in the curve of ^ 
for n^O (mod3), which gradually disappears as n increases since the effect is much greater 
for small n. The same 'period’ is noticeable in Eig, 2, but in Figs, 3 and 4, where the main 
‘periods’ are respectively 15 and 30, the effect is smaller. 

We saw in § 3-1 that, broadly speaking, the advantage of the stratified method decreases 
as the (Tft tend to equality. This is illustrated by comparing Examples 1 and 2. In each of 
these the are equal, but in Example 2 the are proportionally more nearly equal, 
Comparing curves for about the same N {N ^ 65,26,13 in Fig. 1 with N == 60', 30,16 in Fig. 2), 
wo see that in Fig. 2 the range of values of c for which \i{r < 0 is greater than in Example 1, 
Fig, 3 has the same as Fig. 2, but the and therefore the are different. The 
curves are similar to those of Fig. 2, but the stratified method is still less advantageous 
(especially for small values of c). 

Example 4 has five instead of three strata, and there is quite large variation between the 
Ofi, and between the There is no doubt here that ^>0, the only exception being for 
— 32, 6, where ijr — — 0*02. 

These examples may be said to give the maximum advantage to the stratified method, in 
the sense that the calculated values of var (m^} depend on the best method of choosing the 
If the (Tf^ are not sufficiently well known to enable the best values of to he used, then we 
shall get a larger value of var(mj,). It must be remembered, however, that in all these 
examples we assumed that there was no variation between the a situation which would 
be very unlikely to occur in practice. Now it is clear from (xii) that if the same and ctji, 
are considered as in one of the above examples, but the unequal, the effect is to 

increase the value of var(®) by (N—«) Sj{N — I)??-, where 8 — \ so, in any 

example where ^ < 0 for some particular values of N and we can reverse the direction of 

the inequality by choosing a sufficiently large value of 8, say 

“ [var (m^) - var (^)] (W -1) nl{N—n) . 

In comparing different values of for different examples, it must be remembered that the 
order of magnitude of jSo depends on the and a suitable measure of comparison will be 
''"here trj is the pooled variance within strata = 

In Example 1, the largest value of 8^ is for W ~ 13, » = 4. Here var (mo) = 1*9172, 
var («) = 1*6677, and 8^ = 1*917 = 0*231o'g, (If= 0, /tg *= 2, /tg = 3*6, then 8 ^ 2*066.) 

In Example 2, the largest value of is for = 16, = 4. Here var(mo) = 24*647, 

var (*) 5= 19*119, and 8,^ = 28*14 = 0’289o'§. (If = 0, = 7, — 13, then 8 = 28*51.) 

In Example 3, the largest value of 8^ is for = 18, % = 3. Here var (mo) = 46*236, 
var(«) >=! 30*623, and 8q ~ 63*42 = 0*616o-g, (If/^i = 0, /tg — /*a ~ ^ = 67*6.) 

In Example 4, the largest value of 8q is for N = 32,» « 6 (the only occasion in this example 
where ^<0). Here var (m^) == 0*91406, var (») — 0*87097, and »Sfo == 0*2474 « 0*049a'§, 
(If “ 9 and /^a = /t 4 = /ifi = 1, then 8 = 0*286.} 
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Comparison of stratified with unrestricted random sampling 

6. COITOLTJSIONS 

W© har© seen in ^3 that optimum stratified sampling may give a leas accurate estimate of 
/I than uurostrieted random sampling when the are nearly equal, and when N is small or 
the (tJ. are nearly equal. The examples of §4 bear out these eonoIusioUB and show that the 
effect is greatest for small w, l?ig. 3 providing an additional suggestion that if the products 
are widely different the advantage of the stratified method tends to be nullified. In 
practice, we should probably only apply stratified sampling if we knew that the strata were 
suifioiently distinct to ensure considerable variation between either the or the cr*.. In 
the first case, if nothing much was known about the o-j. and a preliminary sample on the lines 
suggested by Sukhatmo was impraotioable, we should use proportionate sampling, and the 
si!ie of iSf would usually ensure that var(TOj,)< var(s). In the second case, we should use 
'Optimum stratified sampling, and rely on the variability of the cTj^. to ensure that 
Yftr (mj < var (»). Since an adequate degree of knowledge about the ctj^ would be unlikely 
unless the 2*4, were quite large, we should in this case almost certainly be safe in using the 
method. To the above considerations must be added the fact that if very inaccurate estimates 
of the (fk ^Te used in (iii), then, whatever the nature of the population, the resulting procedure 
may be extremely inefficient. 

It must be realized, of course, that even if it were Icnown that var < var (ir), it would 

not follow that the optimum stratified method would necessarily bo the most convenient. 
It may be impossible, or at any rate inconvenient, to do any sort of random sampling, and 
some sort of guasi-random sampling may have to be used (sec. e.g. Madow & Madow, 1944), 
but if the principle of random sampling is applicable the stratified method is not likely to 
be much more inconvenient, and in fact in most oases will be more convenient, than the 
unrestricted method. 

Summary 

The stratified method has been used in the past almost solely for large-scale social and 
agricultural surveys. Here the stratum sizes are large, and known results for infinite popula¬ 
tions apply. There seems no reason why stratified sampling should not be U8ed''to advantage 
for smaller populations, and it is importtot to know to what extent these results still apply. In 
this paper a comparison has been made with unrestricted random sampling in the usual case 
where we are interested in estimating the mean, The advantages of tlie stratified method are 
modified, hut in most cases where the method is applicable it will be found to be worth while. 

The above work was carried out as part of the research programme of the National 
Physical Laboratory, and this papbr is published by permission of the Director of the 
Laboratory, The author desires to acknowledge the assistance rendered by Mr D. V. Lindley 
who prepared the diagrams. 
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SOME THBOBBMS ON TIME SERIES. I 

By P. A. P. MORAN 

Institute of Statistics, Oxford University 

One of the principal problems in the theory of time series is to discuss the relation between 
two series, and in the present paper we prove a theorem by which we can test whether two 
such series are independent. Such a test of signiJioance must depend on the models which 
we assume for the probability processes which generate the series. In praotifie, the two most 
useful models are, first, that of a moving average of a series of independent random com¬ 
ponents and, secondly, the solutions of linear stochastic difference equations. 

#), #+l), ... 

be a sequence of independent random variables each distributed in the same distribution 
which we take to have zero mean and its second, third, and fourth moments finite. Then the 
time series generated by n 

X{t) = S aMt-i) 

is a moving average with weights On the other hand, consider a stochastic difference 
equation of the form j(j)+ajX{S-l) + ...+aAX|t-fc) = U) 

In order that the solution of (1) for successive values of i shall form a stationary series it 
is necessary to impose the condition that the^roots of the characteristic equation 

+ ... 4-®* = 0 (2) 

shall all lie inside the circle |«| = 1 (Wold, 1938, p. 63). When this is true the solution of (1) 

can be shown to be of the form » 

X(t) = S ai7){t--i), 
i-o 

00 

where the cci are certain functions of the roots of (2). In tins case S I I Is majorized by a 

i-O 

convergent geometric series. 

Thus we see that both the above models are included in the more general one in which 
we define X(^) as given by 

X(0'= S ai^(t-i)y 
i-o 

where the a? are any sequence of constants satisfying 1) | | <oo. Now suppose 

i«l) 

«().£(*+ 1 ).- 

is another sequence of independent random variables having a distribution with zero mean 
and finite second, third and fourth moments, We write 

i-0 

co 

where ^ | /?^ | <C 30 . T'o discuss whether two such empirical series of this form are correlated 
we prove that the covariance 

8 = SA(o y(t) (S) 

1 
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tends, as n increases, to be distributed in the normal form about zero mean with a second, 
moment which is a function of the and the We shall discuss later the calculation of 
this second moment from empirical series, in which case some care is necessary. 

We first illustrate our method of proof by considering the much simpler problem of deter' 
mining the asymptotic distribution of the sum 

(4) 

a«»l 

We shall show that this asymptotic distribution is also^ under certain conditions, normal. 
This result is interesting because it establishes a central limit theorem (and therefore a law 
of large numbers) for stationary stochastic processes of this type. The law of large numbers 
for Markov chains has been considered by several writers, in particular Bernstein (1927), 
who proves his results by using central limit theorems for non-independent components. 
His theorems cannot be applied in the present case, but some of the ideas of his methods oan. 
Consider (4) above, where ,Z(i) is defined by 


^(0- S diW-i) 

c-o 


00 


and 2 I [ is convergent. There is no Joss in generality in supposing that 


i-O 




Clearly 


2 

i“0 

S S 

3=1 i=0 
as 0, 


Write = o^ = M[X(tn c, = E[X{t)X{l-3)]. 

Then Cj = cr*(«J+afe, 

which are both clearly convergent. Moreover, 

1-1 3=1 

= 4- 2 («- “ i) Cf j . 

tends, as n increases, to = |^Co 4 2 

if this aeries converges absolutely. We shall show that lim nr^R,^ is finite. For Eq is clearly 

( 6 ) 

and this is finite. Moreover, wo notice that is not greater than 


00 


We must now impose the condition that S is not zero. This condition la necessary to our 
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method of argument. If it is not zero, it may be assumed, without loss of generality, greater 
than a positive number, We now show that as n inoreases 


tends, uniformly in and to 


'll 


7i-i r 

jft 


er^^di. 


We require the following lemma (Bernstein, 1927, p, 12): 
Lemm^T. Let 

where and tr,, are random variables such that 

- 0 , 

Then if, for n large, pr{tQ{UI^^)^ < 

rt 

tends, uniformly in and to 


then 


TT-i f e~^‘dt, 

Jt, 

P'{ki^^nY<Pn<hW^l 


where Jj^ - ^{pn) tends, uniformly in ^Q and to 


provided that 


TT-if 

Ju 

lim ^ = 0. 

J^r,. 


Let e be an arbitrarily small number and choose N so large that 

iX) CO 

S <eS |«i‘| <e. 

iw*0 


Write 


x^{t) ^ ^ 
<-0 


Then B{r^) = 0, 

and write 

We shall prove that the distribution of tends to normality, i.e. that 


tends, uniformly in and to 






We first calculate and in another way. For 

S X{i-S)= S S 

|f=l 

~ 1) + (oio + a2^)'^(i“2)+ ,,, + (cto+ + 

<o 

+ S (««+“•+ a<f+»-i)#-'9-^)- 

3«1 

and so = M{T\) = tr*|a§ + («o + <^i)^ + ‘-- + (ao + *'* + ^ft-i}®+ («« + •*•+ 0 ^ 1 , 
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and this series converges. On the other hand, 


y; - -1}-I-K+ai) - 2) +.,. + +... + a^_i) ??(t ” 

+■ S (0^0+ 

J>“1 

4-(«!+... +«.jv} i/{< - -1)+... -H - iV^), 

and so - £r2{a§+...(a(j-l-ai)**+“. + (oio+..,+aj^_i)^ 

+ (il ^ N') (<Xq 4" * • • + “I* (®i + ♦ • • "i“ H" • • • + 


Since we have already sui>poeed that 1, <3^^ la positive, there exist positive numbers Nq and 

i=0 

N 

d suoh that for all S If is not true the theorem is in general false. For 

i»0 

suppose the distribution of the ’a to be non-uotmal and write «! = — 1, ~ 0 

{i>l). Then the distribution of does not tend to normality and its variance does not 
increase with n. We shall later show that this condition on the a’s is in fact satisfied for tlie 
solutions of stochastic difference equations. 

Now by the ordinary central limit theorem, as ?i increases, 

n-N 

— S (0^0+ 

JJ-1 

tends to be distributed normally with zero mean and variance 

R'n — (??>—W) («fl . 4- O'**, 

that is pr{i„(2i2;')* < < i^{2R'^Y] 

tends, uniformly in and to ir-^ f e~^'‘di, 

Ju 


Using Lemma I we see that the same is true, for lilted N, when we replace by Ti^ and M'n 
by ij;. Now „ _ //./ j. ^ 


say, where <3 is what we get if we replace the sequence (a„,ai,...) in by (aw+i,...) and 
alter f, and from (5) we can choose N so large that for n > JV, n-^E{Q^) < e, say. Taking a 
sequence e^, ... tending to zero and choosing first N suflfioiently large and then n and 

using Lemma I again, we see that 

fh 

tends, uniformly in and to 7i~^ I e~^ (It, 

Jt, 


To complete the discussion we must show tliat the condition we have imposed on the 
sequence ct^, ... is satisfied by the coefficients of the solutions of stationary stochastic 
difference equations. Consider an equation 


+ = 7f{t) 

such that the roots of 4-... *P = 0 ,(6) 

all lie inside the circle | 2 | = 1. Then the solution of this equation is given (Wold, 1938, 

P' 63) 

X{t) = S 

i=0 
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where the cl^ are now the solutions of the infinite set of equations 

aiao+^^i =0» 

^ fij 

+... + ajft =0, 

a^ai -f ...+aia^+aft+i = 0, 


and since the left-hand aide is an absolutely convergent double series, we add, obtaining 

«> 

(1-ffiSj-f-2 =* 1, 

i-0 

<« 

and so X <3^* 4 = 0 and, as already observed, without loss of generality, may be supposed posi- 

tt?*0 

tive. This quantity is finite because all the roots of equation ( 6 ) lie inside the circle \ %\ ^ 1 . 
Moreover, it follows that 

^0 ~ “ (14-®!+ 

This is, in fact, proportional to the derivative at zero of the integrated power spectrum 
(Wold, 1938, p. 69). 

We now turn'to the problem of discussing the relation between two such series and we 
consider the asymptotic distribution of 

' CO 

where ■ X(i) = ^ (ai+O), (7) 

1-1 

and (A 4=0). ( 8 ) 

We write in this form rather than that of (3) for the sake of convenience in what follows, 
and we have altered the notation of the sums (7) and ( 8 ) so that they begin with the 
coefficients cti and for the same reason. Writing 

c, = EiX{t)X{t-^s)], d, - E{7{i) F(<-5)3 (a - 0,1,,..),. 

as before, we have 

where <jf and cr| are the second moments of 151 and Then 

m)- S ^(X(-f)r(-i))-o, 

i-1 

= £/( I x^-() r»(-{)+2“s "s y(-oy(~(-'sy 

U—I 1»1 if=l i 

= nCodo+2 X («---s)Cadg, 


(9) 
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Consider the behaviour of as n increases. Clearly 

«,-iJ?{;SJ)^CoC?o+2 i: Cg4=: 0, say, ( 10 ) 

If the series Q is absolutely eonvergent. If X and Y are moving averages or the solutions of 
stationary stochastic difference equations this is certainly true, for in the first case the series 
is finite, and in the second it is majorized by a convergent geometric series. We show that it 
is true in the general case by the following argument, Without restricting generality, we 

CO oa 

may assume, as before, that S ) | < 1, S | AI < h 

<o-f(|ai I+ ...), 

iJO $0 to 

and so <21 ^erf S | A| 


Also 


2 (lAl IAI + -) 


and so CqcIq + 2 2 is finite. We now prove that G is not zero. For 

0 = c,(io + 2 S «A = <rHr(«i+al+...)(/«f+/5i+...) + 2 S S i: a„«™+A/5n«l. . 

<(“1 L s«l m=l n=l J 

and after some rearrangement, this equals 

o’lo’lKai +K A+«a A)“+(«i A+“a A+“3 A)‘ + •• -L 

and (a. A)* greater than zero and theiest non-negative at least. We therefore oonolude that 


where 


0<C<oo. 


Assuming as before that 21 ^ > 21 I < 


we define N so that 


2 I ^ ® 21 I ^ 

i^+i I ' ' 

C30 CO 

2 I A| <<^2 I A| <e, where e is small. 


We now write 


Xi(t) = 2 
i»l 


5^1(0 - 2 




and consider the sum 


( 16 ) 
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Wq begin by proving that when n is large this sum tends to be distributed in the normal 
form with a variance which is asymptotically equal to nGi, where Ci is obtained from C by 
putting ci>i^ 0 for i > ^, For it is then clearly true that 

Now consider = S - 0 Pi( ~ t)i 

1 

where is greater than N. For convenience of notation, we write 

We then have S 2 

where the are certain constants. Moreover 

E{71^Q -*0 ah ]■, y, 

mm alU,i, 

= 0 for^ + fc, 

= 0 if or 

It therefore follows that 2 2 Ab. 

^ 1 1 

Inserting (14) and (15) in (16) we have 

if i>n+iV, or or |i-j| >^^“ 1 , 

and 

AT N 

where 2 l 2 2 

with ^ for i>j, 

= .aiAi+ ■ +ai/?j . for i-j. 

We also have Z'g = 

where the sum is taken over values of i mdj such that | i-j | <N, i^n,J^n Mid either 
N<i oi N<j, where 

Aij j—i=tp>0 

= + • • • + ^ “i " p > 0 

— ■h * • • + for i"j-. 

Then m) = 0, jS?{2i) - (r5(r|22^5^, 

where the sum is taken over the above values of i and j. This equals 

(n-N) o-f e-|[(aiy5jv)“ + («i Av-i + «2 Av)^ + * • • 

■f + ••■ "t (*ivA)^]* ( 1 ^) 
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We know that ^ 0. Let /?, be the first term of the sequence ,,,, /?i which is'not 

2 . 6 ro, Such a term certainly exists* Then the sum in the outer brackets of (17) will contain 
a term of the form (aj and consequently M(Zl) > 0, and for N fixed will increase as {n~N), 


Next we have 


^5 — 


where either 

or 

and 

Then 


i<^nij>n and j~i<Ni 
j^nji>7i and 

= for i-3 = a>>0 

= «Jp-n+-+<^N-pfii> for i-t = 3)>0. 

J!(2s) = 0, E(i:i) = trlfflSSAlp 


(18) 

(19) 


where the sura is taken over the values (18) and (19). 

n+N n+N 

Finally Si =* S S 

where + •«• + for i—j~p>0 

= + for 

" +*‘.+aAr/?iv for i n+p>nf 

n+N n+N 

and i!(S4) = 0, i5(Sj) = <rM S S ,A%. 

i*n+l J*=n+1 

Wo readily see that E(S(Sf) = 0 for t'+j 

and therefore "j!(S;») = E{Sl) + BiSI) + B(ZI) + E(2:i). 

I 

Moreover, for constant j®(i^|) and M{Zl) are constant, and so for large n we have 

-> Ojj = o'5cr|[{aj^^)** + {^i^n-i ++* * • 

+ i^iPi + r - + ajv Av)® + • • - + («iv A)®] + fi- (20) 

^ n ' 

Now suppose that N is fixed and consider the sum S Xi(— i) Ti(— t). We write 

1 

n “ m(w+W)+p, 


where|)< 2m+N +1 and n is large enough for m to be greater than N. This equation fixes 
m which increases roughly as when n increases. Write 

( N N+m m*+mN n \ 

s+s+.-.i- s + s )Xi(-t)yi(-t) 

(-1 iV+l lfm{2V+m-l) n~p+l/ 

- K + + Fa++... + +C4, + W. 

ThenT^, .-.jKjt ^ independent and E{Vl)t -®{FJi) are independent of and in 
fact not greater than KN, where Z is a constant independent of N. Also is not 

greater than K{2m-{‘N +1), where K may be taken as the same constant, 
also all independent and £}{Ul^) is asymptotically equal to mG^ when n (and therefore m) 
are large, Therefore, writing 
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W6 hftve J = 0, = S ^v% 

i-l 

m 

M(BJ = 0, JS{5*) = 2 i?(yf)+fi(Ifii), 

i^l 

and the latter increases as m, whilst the former increases as and so 
as n increases. 

By Lemmal it is therefore sufficient to show that the distribution of tends to normality. 

Lemma II (Liapounoff’a Central Limit Theorem, Bernstein, 1927). If 

is the sum of m independent quantities such that 

m 

and if, as m increases, 

r-l 

m 

where = 2 

r*". 1 

then KM2h«,)*<2’„<fi(2hm)‘} 

tends, uniformly in Iq and to 7r~l e"^* dt. 

J if 

To apply the lemma we put 17^ = We already have E{U^) = 0. Also 

by (20), 

and so 

Now consider 

CO CO 

where = S 2 ^pq^r(N-trr-i)+p-i CtV+r— 

3>=1 ff-1 

and the are calculated with m in place of n. Since the ly's all have the same probability 
distribution and similarly for the ^’s, we shall write and for and &<iv+r-j)+<i-i 

for the sake of oonvenienoe. So we can write the above 

CO CO 

^ “ 2 2 -^pqVp^* 

p^l q=i 

Z7J will be a polynomial of the fourth order in the j/’s and the J’s and its expectation may 
be regarded as the sum of two distinct types of terms so that ^ SE{w^’\'SE{w^^ 

where the terms w-^ are of the form and the terms % are of the form 

with (p, y)=l= (^» l}> All other terms arising in the product will clearly vanish when the ex¬ 
pectation is taken. 

Then, since the are bounded and the number of non-zero terms in and are not 
greater tlian 2A'(?n-t- N) and 4N\m-\-Nf respectively, we have 

^(^7f)<Am^ 

where K is a constant depending on N hut independent of m and w. It follows that - 

m 

CSC?”' 

1 
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IB of order and tends to xoro as n and m increase, The conditions of the lemma aro there* 
fore satisfied and we conclude that 

^A„< t,{2B(Al))t} 

tends, uniformly in tj and (j, to W* j* e~‘’ dl, 

J u 

Applying Lemma I we have 

lends, uniformly in and «i, to the same limit. 

Wo now consider tlio relationship between 8^ and iS,^, Write 

1 1 

=.1 {k (I. - A (,l (I 
= kl 


Li-jv-n 


4" 


s( 

1-1 V 


CO 


. S a-iVi-H 

t-w+i 




= Wi+W,-vW,. 


( 21 ) 

We must now calonlate the variance of these terms. Consider again (9). We have shown 
(11) that 

2 S ^ 2(r5or|(| ydo I +1A1 + • 


< 2or|o'|(| 0^0 I + I fJii I + 

and we now apply this to the three sums in equation (21). It follows that if N be chosen to 
satisfy the conditions (12) and (13) then 


Vix^in'-^E{W\}<K(T\ale\ \[m'nr^E{Wl)<K<r\(rle\ \imn.-^Ji}{W^)<Kcrla‘le^ 


where if is a constant independent of AT, 

ItfoUowsthat S„ = Si+.Wi+ffa+Wa. 


where the variance of Wi, and can be made small compared with that of 8^ by choosing 
N large. Then by first choosing N large and then n and using TohebyohefF’s inequality, we 
see that the distribution of 8„ tends to normality with varianoe and this completes 

the proof. 

In the general application of the above results some care is needed. We can suppose that 
our empirical values of X and Y are distributed about their sample means which we take 
to be zero and we must estimate the variance of 8^ from formulae (9), or (approximately) 
from (10). But we must not insert in this formula the sample covariances for the and the 
dg because, as Bartlett (1946) has shown, the standard errors of the sample values of these 
covariances are of order 7i~'^ and we cannot therefore expect the series (10) to converge, let 
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alone give the oori’eot value. To use the formula coiTectly we must first decide on the order 
and ooe/Ecienfcs of the stoohastic difference equation which we can suppose generated the 
series and, from these coeflfioients, calculate the value of (9), 

In the case where the series are generated by a three-term difference equation, the calcula¬ 
tions are simplified. Suppose the X and Y satisfy the equations 

X(t+2)+ aZ(t + l)-P tX(t) = #+2), 

where = E{^{t)) - 0 

and Mmt)) - (rl i?CT)) = crl, 

as before. Por the series to be stationary, we must have 6 < 1, B<1. We suppose that in 
addition to this the series are oscillatory and so a® < 46, .d® < 45. The solutions will then be 


^0) = E 2(46— 


wherep = 6*, P = 5^, cos^ = -a(26i)”i oos^ *= ™A(25l)“^ Also (Kendall, 1946, p. 408) 


where 

and 


- ^ _ p*sin(s04-_ P*sin(s^+0) 

Ofl sin^ * ® ““ dg"" sinCP ’ 

1 — ^5^ 1 — 

tan^ = j-j-^tan^ and tan0= ^-j-^tan^ 

_ 1+6 J _ 2 1+5 

(i _ ft) {(1 > «0 “ + _ ^ 2 } • 


We then need to calculate 


oO 


I / — 00 ^ 0+-2 2 
8-1 


«3 


= Cg4 l-h2 53 fjP, 




— Cgdifl 


I 4 . V + jr) sin {s^ + 0)) 

sin ifr ain?P 


) 


= co<^oU.+:7; 


2pP foos (^-0 + g-^)-pP cos {\/r-0) 


( 22 ) 


sin ^ sin <P |_ 1 — 2pP cos (^—+p®P^ 

oos(\i^+<P+9+^)-pP cos {jr+^y 
1 — 2pP cos (6+0) + p®P® 

It is probably easiest to calculate G from this equation rather than attempt to simplify 
(22) still further. I hope to discuss the practical application of these formulae in another 
paper. 
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RANK CORRELATION BETWEEN TWO VARIABLES, ONE OE 
WHICH IS RANKED, THE OTHER DICHOTOMOUS 

By J. W. WHITEIELD, Psychological Laboratory^ University of Gambridge, 

Rank correiation is one of the most useful statistical techniques available for the treatment 
of data arising in experimental and applied psychological research. Chambers (1946) has 
indicated the type of data most frequently occurring in these fields, and has pointed out the 
advantages of Kendall’s r over Spearman’s p or any form of transformation to ordinal form. 

Given the use of t when tied rankings are present (Kendall, 1946) it seemed possible to 
extend the method to cover a very common problem in psychology, namely, determination 
of the relation between two variables, one of whicli is expressed as a ranking and the other 
as a dichotomy. In applied or field work the relation of a psychological ‘measurement’ and 
an external criterion nearly always appears in this form. The usual method of determining 
the relationship consists of reducing the ranking to a dichotomy and calculating for the 
2x2 table which results. That this may lead to inaccuracy can be seen from the following 
example: 

Variable^ 1 2 3 4 6 6 7 8 6 10 11 12 13 14 16 16 17 18 19 20 

Variable 

Variable + + + + + — 4'-t' + 

Here the data are supposed to be ranked according to variable A and dioliotomized into 
-f and - with respect to variables B and G, 

Treating the relation between variables A and R a.s a 2 x 2 contingency table: 



Variable B 


+ 


Variable A : Bankings 1-10 

7 

3 

Bankings 11-20 

3 

7 


Applying P is found to bo 0*074 without Yates’s correction for continuity, or 0*180 if 
the correction is applied. 

But is exactly the same for the contingency table relating variables A and G, although 
it is obvious from the data that there is considerable difference in the two relationships, the 
evidence for which is sacrificed by reducing the ranking to a dichotomy. 

If, alternatively, we Consider the dichotomous variable as a ranking composed entirely 
of two sets of tied rankings, we may calculate the coefficients between A and R, A and C 
respectively which 1 shall denote by rThe corresponding values of S will be found 
to be, after the manner described by Kendall (1946); 

R = +70- 9 + 21 = 4-82, 

R =-30 + 49-21 = - 2. 

For the oaloulation of t in the case of tied rankings we have a choice in the denominator 
by which S is to be divided to give r. In the untied case this would be 1), where n is 
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the number of ranks. In the tied case we may take the denominator S' aa 1) or as 

I) —1)}]*, where t^, etc., are the extent of the ties. The choice is 
determined by practical considerations (see ICendail, 1D46), but is not material to a discus¬ 
sion of significance. For an untied ranking and a dichotomy with ' a:' and ‘ y' members in 
each class, the second form reduces to - I)}k 

In the case of two untied rankings Kendall has shown that var^Sf — — 1) (2n-4*5). 

In the case of one untied ranking and one with ties of extent ig, etc., Sillitto (1947) has 
extended tliis result by proving that 


Ill the case of an untied ranking and a dichotomy, = .t, = ^, (ai -f y) = ft, and we have 

then the simple form var^f = (2) 

In the example above this gives 

y^r VM = 20-46, i = MO. 

The probability of a deviation greater than this in absolute value is 0-0022. Further, 


>Sj^c ^ 2—1 

^(var^S) 26*46 


and the corresponding probability is 0*970. 


0*0378, 


Vaeianoe when thebe abe ties in the banjcing 

The variance of S given by equation (2) is true only in the case of a dichotomy and an untied 
ranking. For a tied ranking 1 surmised fi*om some special cases that 

varS = _^{(a»-n)-2;(i»-J)}. (3) 

In the note following this paper Mr Kendall provides proof of this result. 

JiJxampU (from data collected by the Medical Research Council team in Germany 1946, 
as yet unpublished). Selected workers in a factory were interviewed and an assessment 
made of their adaptation to living conditions. They were assessed as 'Efficient* or ‘Over- 
active ’. Other data were available, including statements by the men of frequency of nocturia. 
For men aged 60-59 years the following was observed: 


Assessment 

Rank order of frequency of nocturia 
(least frequent nocturia given highest rank) 

Effioiont 

Overaotive 

H, 2h BI, 10, 10, 10, 10, 14, 14 

6 , 10, 14, 16, 17 


Five is the highest ranking in the overactive group. Four members of the efficient group 
have higlier rankings, and eight lower ranlrings. The S score for that member is tlierefore 
4—8, Similarly, for all members we have 

,5= 4-8 + 6-2-f-lO-f I2-tl2 =2 +34. 
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Using a denominator in the form 

[xy{\'fi{n -1) - -1 ))]^ 


T is given by 

34 _ j 

’■ = VK J 2) (5) {4(17) (16} - i(4) (3) - i{2) (1) - i(S) (4) - i(3) (2)}]" ^6980 
From (3) we then have 

vftrS-^^^{(17'-17)-(4’-4)-(2’-2)-(6»-6)-(3»-3)} 
= 344-6. 


A small problem arises when we consider the oorrection for continuity to be applied in 
testing the significance of an observed value of 8, In the case of a dichotomy and an untied 
ranking the interval between successive 8 values is 2. In the case of a dichotomy and a 
ranking composed entirely of ties of the same extent' t the interval is 2t, But in the example 
the ties are of varying extent, and the interval between successive 8 values is composed of 
a mixture of the intervals produced by tlie successive rank values. Thusj although these 
varying intervals are combined so that over most of the range the interval between successive 
values is unity, the distribution oscillates somewhat, and to use the value ^ as the correction 
for continuity would sometimes be misleading. Further work is required to determine the 
correction which will provide a probability on the normal distribution equal to or slightly 
greater than the true probability in all oases. Until this is available I pi’opose to use a crude 
correction, based on the average of the intervals mentioned above. In the example the suc¬ 
cessive rank values 2^ and 6 give an interval of 6 in 8 score, rank values 5 and GJ give an 
interval of 8, and it is therefore possible to determine the average interval by calculating 
the intervals given by successive rank values. This calculation can be shortened. The total 
of the 8 score intervals is twice the number of members, less the extent of the tics involving 
the first and last members. If we divide this by the number of intervals between successive 
rank values we have the average 8 score interval, In the example this is ^{34 - 4 -1). Using 
half of this as the correction for continuity we have 

8 34-2-42 

V(varjSf) 18-66 

The pre-observational hypothesis, made on psychological grounds, was that excessive 
nocturia is a symptom of inefficient adaptation to living conditions, i.e. a positive correlation 
should be obtained. From these observations the probability of a positive, correlation as 
great or greater than the observed value appearing by chance is 0-044. Direct calculation 
of the positive tail of the distribution of 8 gives a probability of 0*0368. 

The alternative testing hypothesis ba^ed on the absolute value of 8 gives a probability 
twice as great, and the corresponding direct calculation using both positive and negative 
tails of the actual iS distribution gives a probability of 0*0736. 

By itself this evidence could only be debatable substaiitiation of the psychological 
hypothesis. In fact, additional data from two other factory groups, treated in the same 
way, gave a total 8 value of +104, the square-root of the total v ariance being 36*00, providing 
a justification of the hypothesis. 
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The oash oJ' the 2x2 table 

If one diohotoinoua variable can be considered as a ranking with two sets of tied ranks it is, 
logical to consider the ease when both variables are in this form, If we have a 2 x 2 table 
in the form 


(AB) 

-- - 

(^b) 

(A) 

(aB) 

(ab) 

(a) 

(B) 

(&) 



any member of (AB) taken with any member of (ab) has the same order in either ranking 
and hence contributes +1 to <Sf, and any member of with any member of {aB) con¬ 
tributes — 1, The others contribute nothing. Hence 

From equation (3) 

-F^Ti • 

Again, for testing the significance of an observed value of S it is necessary to correct for 
continuity by subtracting half the interval between successive S values. In the case of the 
2x2 table the interval is N) for if we increase (A. 5) by unity 8 becomes 

{(AB) +1} {(ad) +1} - ((^6)" 1} ((^5) -1} = (AI3) (ab) - (Ab) (aB) + N, 

Hence, for the normal deviate, we have 



8-W 

(^) (a) (H) (6) r 

N-l I 


(6) 


It will be noted that t (taking the ties into account in calculating the denominator S*) is 

(AB) (a6) - (Ab) (aB) 

^{A)mB){b)-] ■ 


which is the product-moment oon’elation for a 2 x 2 table when the variables are conven¬ 
tionally regarded as possessing the discrete values 0,1. 

Testing by use of the normal deviate seems to be moderately accurate, and would appear 
to bo useful in those oases where is suspect because of small expectations in the cells of 
the 2x2 table. It is less laborious to calculate than the hypergeometrio treatment, and is 
an alternative form of the approximation to hypergeoraetric treatment given by Pearson 
(1947), who also discusses the order of accuracy of the approximation. 

Using the data given earlier as an example, but assuming that it had been possible only 
to grade nocturia into ‘Normal’ or ‘Excessive’, we have the following table: 

BiometHka 34 
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Assessment 

Nocturia 

Normal 

Excessive 

Efficient 

Overactive 

10 

2 

2 

3 


This gives, after oorrection for eontmuity, 

^{Y&£8) 16 

This gives the probability of 8 bemg'attamed or exceeded in the direction of the hypothesis 
(i.e. positive values only) as 0 ’ 1217.«,^2 without the continuity correction gives ? 0*0369,’* 

and witli the correction, P = 0-1U3, The hypergeoraetrio treatment, summing the prob¬ 
abilities of obtaining 3,4 or 6 in the Overactive-Bxoessive category, gives P = 0*1166, 

If the more oustomary test of absolute value is applied, with Yates’s correction gives 
P - 0*2286, 8 and the normal deviate gives P = 0*2434, i.e. both values of P are doubled. 
The hypergeometrio treatment, adding the probability of obtaining 0 in the Overaotive- 
Excessive category gives P =« 0*2446. 

It will be seen that in conditions such as these, 8 and the normal deviate give a reasonable 
approximation to the exact treatment, 

* This is making the common assumption that (oI?)}*2^/{(,4) (a) (B)(6}} is dis¬ 

tributed as ;i^* with 1 degree of freedom, or that its square root is a normal deviate with sign depending 
on tho sign of [AB] (<3(6) -- (i46) (cfi?), 
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THE VARIANCE OP t WHEN BOTH RANKINGS CONTAIN TIES 


By M. G. KENDALL 


1. The variance of r in the population of sample permutations was given in my paper of 
1938 for the ca^e where no tied ranks exist. Mr Sillitto (1947) htis given the formula where 
one ranking contains ties but the other does not. In the foregoing paper Mr Whitfield has 
correctly sui*mised the variance when one ranlcing contains ties, and the other is a dichotomy. 
In this note I derive the general formula for the variance when both rankings contain ties. 
The results of Messrs Sillitto and Whitfield then follow as special cases. 

2. I shall follow the method of Daniels (1944). If represents the contribution of the 
ith and jth members of a ranking to r we have 


{i<j) 


- 0 (i=i) 

-“1 {i>j) 

(1) 

We write 

where a and b refer to different rankings, and 

(2) 

il 

• (3) 

The quantity o is simply related to by the relation 

c = 2 jS, 

(4) 


and for the testing of t it is sufficient to test Got S which are merely constant multiples of t. 
I work with the quantity c. 


3. We have, from Daniel’s results, 


rt 


y. ct^ ^ 4' 1 ” 22', 

2=1 


n 


s o?, = n(»-l). 

<, 1-1 


n 


S a^i aif ^ 
i,Z,2=l 


E{c) = 0, 


(6) 

( 6 ) 

(7) 

( 8 ) 
(») 


( 10 ) 

(il) 


If we substitute from (6) and (7) in (9) we find 

jS?(62)=|?t(7t-l)(2ti + 6), 

or, equivalently, E{S^) = ^n{n -1) (2n + 6.), 

from which the variance of t in the case of untied rankings follows at once. 

4. Now suppose that sets of ij, ... consecutive members in one ranking are tied. In 

place of (6) we then have n 

S = n{w-l)-Si(i-l), (12) 

i,i=i 
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The variance of r when both ranJcings contain ties 


the summation on the right taking place over the various values of t This result follows 
simply from the consideration that for a pair of tied ranks Uy ” 0, and consequently the sum 
of squares of contributions from a tied set is of the same form as for the ranking as a whole. 


In place of (7) we have 


n 


H - \n{n^ - 1 ) - P tif - 1 ). 


(13) 


This is not quite so obvious. Consider a set of tied ranks. The contribution to the sum on 
the left of (13) will be unchanged if the suflBxes Z, t fall outside this set. If they both fall 
inside, ho contribution arises and therefore wo have to subtract the term — 1). The rC' 
maining possibility is that one falls inside and one outside. In such a case the contribution 
remains unchanged in total for it is zero in the original untied case, each possible pair occur* 
ring onoe to give +1 and one to give -1. iTormula (13) follows. 


5. By substitution in (9) we then have, for two rankings with ties typified respectively 


by t and u, 


M{c^) = 




{W?i-1) (»-2) - 1) (i - 2)} 


X {J«(^ -^){n 2) - " 1) {'Zi2)} 


+ 1) - ^t{i -1)} {n{n -1) - "Luiu — 1)}. (U) 

This is the general formula required. Wo can express it in the alternative form 
E{c^) ^ ^ 1) (27t -f 6) ~ -1) (2f+6) - -1) {2ii H- 6) 

+ ;;;^j{a(«-l)}{S»(»-l)}; (Ig) 


C. (i) If one ranking is untied, say all the rt’a are zero, wo have Mr Sillitto’s result 

-|w(r^-l)(2rj.+6)-Pi(j5-l)(2i+6). (16) 

(ii) If one ranking is untied and the other is a dichotomy into x and n-x^y members, 

(18) reduces to ^ ^j{n+ 1), (17) 

agreeing with Mr Whitfield’s equation (2).'' 

(iii) If 6ne ranking contains ties and the other is a dichotomy we find on substitution 
in (U) • 

agreeing with Mr Whitfield's equation (3). 

(iv) Finally, if both variates are (liohotomized into x, y and p, we find 


w)= 

n-1 ’ 


(19) 


agreeing with Mr Whitfield’s equation (4). 
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into the ith group, and let m j (i - 1,2,..., )fc) be the expected number. It is possible thooretic- 
aliy for to be calculated for the case where 

fc 

but such oases must be rare in statistical practice, Wo shall overlook this case and will 
consider the case where the totals of observed and expected are made equal to one another 
with the resultant loss of one degree of freedom in the calculation of If tlie totals agree thou 

h k k 

2 E E 

where di = In order tliat tJie sum of these should he zero, at least one of them 

must be negative in sign, but which one of these 8'b it will be would seem to be a matter of 
chance. It is on this fact that we shall base the first test criterion. 

4. Suppose that we have a sequence of signs of which are positive and negative, 
where = r, and > 0 and > 0, These signs are postulated to occur in a random order. 
Given such a sequence it is easy to record the number of sets of positive and negative signs. 
For example, if the sequence is 

+ +-+ +-+ - + + + + -, 

then r ~ 15, ?*! != 9, == 6, and there are four sets of positive signs and four sets of negative 

signs. In general there can be (a) t positive, t negative, or (P) i positive, 1 negative, or 
(y) f +1 positive and t negative sets of signs, If T — 2t or +1 as required, we may ask what 
is the probability that given ri and such a number T of sets (alternately positive and 
negative) would have arisen through chance, This probability follows at once from Whit¬ 
worth, Choice and Chance, Proposition xxv, viz.: ‘ The number of ways in which n indifferent 
things can be distributed in t different parcels (blank lots being inadmissible) is 

[n- 1)! (n-—f)l’.* 

6. The total number of ways in which Ti and elements can be arranged is 

r! 

We now require to enumerate the number of Ways in which can be arranged to form t sets 
and to form t sets. To arrange ?’i in t sets is equivalent (vide Whitworth) to making /1 
breaks in a sequence of observations, and this may be done in 

(rj -1) l/(^ -1)! (ri - i)! ways, 

and similarly for ra* It is not specified whether + or — should start the sequence, and hence 
the total number of ways in which a sequence rj may be arranged in t sets each is 

2 (^ 1 -1)1 (r,-1)1 

* Since I first thought of this method of attaok I have found that the distribution of groups as 
given by mo in § S has already been given by W. L. Stevens, Ann. Eugen., Land., 9,10, and by A, Wald 
end J. Wolfowit?., Ann, Math. Statist. It, 147. The probability function has been tabled by F. S. Snned 
and C. Hjisonhart, Ann, Moth. Slutiet, 14, 66, but it is not in a form that I found suitable for my puv- 
posas, The probability function has been known for many years; what is interesting is the different 
uses to which it has been put. 
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The probability of 2t sets will be 

and the probability of obtaining (28+1) sets will be 

11 n, r,) P{8 1 r„«+1 1 r,} + P{8 +11 «| r J - P{2f | r,} . 


( 1 ) 

( 2 ) 


Hence given and T from a random sequence of positive and negative signs the 
probability of such a number of sets having arisen through chance may be calculated. 

6. It is desired to use the probability of a given arrangement of signs in order to test a 
given hypothesis represented by a smooth probability law, bearing in mind that, if the given 
hypothesis is not true, then any alternative law is lihbly to be of a smooth type. Although 
no exact definition of a smooth alternative distribution has been made, it may be stated 
here that smooth, in the sense used by Neyman, will imply that the number of sets of signs 
will be small. ITor example, if the hypothesis tested is that observations follow a given normal 
ourvo, whereas in fact they have been drawn from a normal distribution identical with the 

, hrat but with a smaller mean, then, the differences between observation and expectation on 
the basis of the hypothesis tested may be expected to give a preponderance of positive 
signs below the sample mean and of negative signs above it; that is to eay, if the difference 
in moans is sufficient to offset the sampling fiuotuationsj we should find a single set of positive 
signs followed by a single set of negative signs. If the true population is a normal curve with 
the same mean but with a larger standard deviation than that specified by the hypothesis 
tested, then there will be a tendency towards a set of positive signs, a set of negative signs, 
followed by a set of positive signs, although sampling fluctuations may not leave such a 
olear-out answer. The more complex the alternative hypothesis the less chance there will be 
of detecting it. 

7. With this objective in view it is proposed to take T, the number of sets of signs, as 
the test criterion, rejecting the hypothesis tested whenever, for a given and T is except 
tionally small. This we do on the grounds that the existence of very few sets of signs suggests 
that the differences between observed and expected frequencies are not due to chauce sam¬ 
pling fluctuations but to some systematic departure of the true probability law (assumed 
smooth) from hypothesis. In following this procedure we should reject the hypothesis if 


2’=. 3 


where is the observed value of T and e the signifloance level selected as appropriate. 
Exact probabilities are given in Table 1, and the application of the test is immediate.* 
There seems to be no reason why the test should not be applicable to both grouped and 
ungrouped observations, although the formulation of the hypothesis tested may be some¬ 
what different in the two cases. Consider a sample which has been supposedly randomly 


* An assumption implicit in the teat would appear to bo that for oaoh cell there is an equal cliance 
of obtaining a positive or a negative deviation, that ia, tliat there ore sufficient numbers in each cel) 
for tho binomial to bo closely approximated to by a normal curve. An extensive series of random sam¬ 
pling experiments baa shown, liowevor, that the divorgenoe between theory and practice is not sig¬ 
nificant even when the probability of obtaining a positive is four times that of obtaining a negative. 
Hence while strictly the expectation in each cell of should bo 10 or over, it would seam that for 
practical purposes that the T tost may be applied in all cases whore the application of tlie y® teat is 
pftrmisBible. 



Table 1. Probability of obUiining a given nuwAer of seta, T. [ or 2t +1] 


The function tabled is 
ia even oi' odd. 


I**!-*)! 


(,*i~l)I(ra-l)!(ri4-ra™20 „ 


P(T) is obtained by dividing Uiis amotion by the binomial term 

































E. N. David 


303 

drawn from some population, Let the elements of the sample in order of drawing be 
We may use the T criterion to test the hypothesis of randomness, in the fol¬ 
lowing way. If Wj. is the smallest value observed in the sample and Uj^ the largest, then if we 
exclude the trivial ease when all the ic’s are equal it is easy to show that where 

_ 1 " 

x = - S If consider the deviations 

Xi-x = dXi for i = 1,2, 

there will be a series some of which quantities will be positive and some 

negative. The application of the T test is immediate, the admissible alternate hypotheses 
being that if the drawing of the sample is nob at random then bias of the smooth kind is 
present. 

8. As an illustration consider the following two cases; 


Cose I. 

Expected frequency 

10 

26 

35 

76 

166 

166 

76 

36 

25 

10 


Observation 

12 

29 

45 

81 

160 

146 

69 

31 

20 

8 


Deviation 

+ 

+ 

+ 

+ 

+ 


— 

— 

_ 

— 

Case 11. 

Expected frequency 

10 

26 

36 

76 

166 

166 

76 

36 

25 

10 


Observation 

12 

23 

46 

60 

161 

160 

69 

36 

20 

8 


Deviation 

+ • 

— 

+ 

_ 

+ 

+ 

- 

+ 


— 


In the first case Q-nd in the second = 6*80; in neither case would the hypothesis 

be rejected as inadequate by using the criterion. The T criterion docs, however, bring 
out the essential difference; 

Case I. ~ 6, ?'2 = 6, “ 2 and ~ 

Case II. — 6, ^ = fIf* 

Using the T test we should be inclined to reject the first hypothesis in favour of a smooth 
alternative, while for the second case we should be inclined to agree with the conclusion 
drawn from the f^st that the observational material is adequately described. 

9, Sampling inaterial is available whereby the theoretical distribution of T may be tested 
in practice. Neyman & Pearson (1928) took 208 samples, each of size 200, from a population 
of eight groups described by the cubic curve 

y = 26 + -*^,'r~^a;3, 

The expectation in each cell for a sample of this size was calculated and the;^® criterion found 
for each of the 208 samples. The wi’iter was given access to these calculations and was able 
to find the sampling distribution of T from the material. The results of this sampling experi¬ 
ment and the theoretical distribution of T from relations (1) and (2) are given in Table 2. 

The agreement between theory and practice would seem to be reasonably good, and in 
the cases (4,4) and (6,3) the values of calculated to test the discrepancy between theory 
and practice, were not greater than might be attributable to sampling fluctuations; It was 
not thought worth while to calculate (fi> 2) and (7,1), A second sampling experiment 
in which samples of size 3C0 were drawn from a normal population of fifteen groups lent 
further support to the reasonableness of the theoretical distribution. 

10. The T criterion will be a useful supplementary criterion to the hut because it 
takes account solely of the sign of a distribution and not of its magnitude it will probably 
only be useful when used in conjunction with A test of significance which could combine 
both the probability levels of T and would undoubtedly be more useful, and we may 
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ther.efore consider how this might be done. Unless the exact degree of dependence which 
exists between two variables is known it is usually only possible to obtain their joint dis¬ 
tribution if they are independent. It would appear reasonable, both on theoretical grounds 
and from sampling experiments, to assume that T and are independent, or, if the assump¬ 
tions underlying both tests are not exactly fulfilled, to assume that the degree of dependence 
between them is at most small. 


Table 2. Gomparison of iheor^iml diatribiUion of T with 
that derived from a sampling experiment 

(4 positive, 4 negative) 


T = number of sets 

2 

3 

4 

6 

0 

7 

8 

Total 

Sampling 

3 

5 

20 

26 

23 

S 

7 

93 

Theory 

2'7 

S'O 

23'9 

23-9 

23'9 

8'0 

2'7 

03 


{6 positive, 3 negative) or (3 positive, 6 negative) 


T “ number of seta 

2 

3 

4 

6 

6 

7 

Total 

Sampling 

2 

8 

32 




■i 

Theory 

3*6 

10*9 

29'1 

20*1 

21*9 

7*3 

11 


(6 positive, 2 negative) or (2 positive, 6 negative) 


T = number of seta 

2 

3 

4 

6 

Total 

Sampling 

1 

3 


6 

9 

Theory 

0*6 

1*9 

3‘2 

3-2 

9 


(7 positive, 1 negative) or (1 positive, 7 negative) 


T = number of sets 

2 

3 

Total 

Sampling 

— 2 

2 

Theory 

0*6 

1'6 

2 


11. We.shall begin by demonstrating that as far as mathematics are concerned the T 
and Xf criteria are completely independent,* iTor simplicity of argument let ns consider the 
case of three groups only. The sample may then be represented by a point in 

three-dimensioned space, with axes of reference Owj, On^, On^, and the expected population 
values by a point {mi, mg, mg) in the same space. Since 

= mi-hma-tmg = N, 

* This method of approach was suggested to me by Andrew Gleason of Harvard University at a 
SGtnitmv given at tlie Statistical Laboratory, University of California, at Berkeley. 
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these points are constrained to lie in a plane. Fig, 1 shows this plane for the particular ease 
N = 16;mi = 4,mjj = 8 ,m 3 = 4. Since no frequency can be negative, possible sample points 
must be within an equilateral triangle lying in this plane, the chance of occurrence associated 
with a point being the multinomial term 

W W [ n ) ' 

When using the teat the mathematical approximation consists in substituting for this 
term an expression proportional to in regarding this last as a continuous function, and 



]?ig, 1. Graphical illustration of the contours and the ohange in signs of the 5/i’b. 
Wj, Wj and ns denote the points of intersection of the Oni, Onj, 0th axes with 
tile plane ni+Wj+^s = According to the approximation, the chance equals 
a of obtaining a sample point lying outside the elliptic contour on which x — Xa* 


in taking as a measure of goodness of fit the integral of this expression outside the eihpse 
which passes through the sample point and on which ^ is constant. For the case of three 
groups this integral itself assumes the simple form Three such elliptic contours e^i'o 
shown in the diagram. 

Planes through (ywj,^ 2 ,WI 3 ) parallel to the oo*ordinate planes will 

intersect the sample plane 

tfci+Yia+tijt “ N 
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in three straight lines. As shown in the diagram, these lines divide the sample plane into sU 
sectors, and for all sample points within a sector the signs of the differences dUi ^ ni-irii 
will remain unchanged. Any test based solely on runs of signs will consist in taking one or 
more of these sectors as critical regions and rejecting the hypothesis tested when the sample 
point falls therein. It is clear that if we use the mathematical approximation, the distribution 
of is the same within each sector^ similarly, that the chance of a sample showing a given 
combination of signs is the same on each ellipse along which is constant. Thus under the 
assumptions made regarding the distribution of the T and criteria are completely 
independent, 

In this case of three groups T can only assume values of two or three and the former value 
would not he judged significant, but the argument will follow exactly similar lines in the 
case of many groups. The number of sectors will be in general 2(2r2+ 1) if r-^>r^ and 
if ri = ra, and they will be bounded by primes passing through tlie population point. 

12. While the distributions of T and x^ independent for tliis mathematical model 
they are unlikely to be exactly so when we go back to the true multinomial density dis¬ 
tribution, because the sample space is neither continuous nor infinite. The model, in fact, 
becomes inaccurate if or are very small For example, it is seen in Fig. 1 that while 
the 1 % ellipse (x - Xooi) completely within tlie triangular space for the sectors with 

signs -1-h and-h, it lies completely without the space for the sector + -l— and 

partly witliout for the otlier sectors. It has been thought worth while therefore to test 
whether the two criteria are independent in practice, and to this end the same material 
previously described has been utilized. Tables 3 and 4 give tlie distribution of mean x^ 
for different values of P{T] and the distribution of mean for grouped values of y®. 
There is little evidence in these figures to show tliat P{!7'} and (and therefore are 
related. The figures therefore lend support to the geometrical argument and indicate that 
the approximations involved in both from the small sample and the fact that the sample 
space is not infinite, do nob invalidate the mathematical result. 

13. In order to combine the and T tests of significance it will be necessary to develop 
a theory for the combination of two tests of significance when one criterion is a continuous 
and the other a discontinuous variable. E.. A. Fisher has set out the test for the combination 
of tests of signifloanoe from a number of independent continuous variables. The keystone 
of the test is the recognition of the fact that if Z is a continuous variable, then where 

J —00 


is also a continuous variable equally likely to have any value between 0 and 1; we shall 
describe z as being distributed rectangularly. Twice the logarithm of the product of two such 
z's, say Zi and 2 ^, where and z^ follow from two independent tests of significance can be 
shown to be distributed as ^ with four degrees of freedom. Consider a discontinuous variable 
X which may take values and which has an elementary probability law 

P{X = X,} = :P„ where j = i.2,..„e 

and S = 1 . 

If a new variable, x, is defined os taking values •• - where 

k 

»* =« S Pj^ 
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Table 3. Mean for different values of P{T} 


I'l, ^'2 or r^, J 1 

_ _ 

4,4 

6,3 

4.4 

6,3 

6,2 

4,4 

6,3 

4 

6,2 

No. of obs. on which 

26 

6 

20 

28 

30 

_ 

26 

1 

32 

20 

3 

mean is based 










P(T) 

1‘00 

0-97 

0*93 

0*86 


0*04 

0*63 

0>43 

0-37 

0*29 

Moan y® 

7*19 

6-04 

6-87 

6*43 



7-4I 

7-32 

6*82 

4*86 


rj, I'i or rj, 

No. of oba. on. which 
mean is based 

7,1 6, 3 4,4 0,2 

— 8 6 1 

6 , 3 and 4, 4 

B 

pm 

Mean y® 

' 0*26 0*14 0*11 0*07 

— 6*10 6'53 0*96 

0*04 and 0*03 
8-06 


Table 4. Mean P{5r} for grouped 



0 *0-l*0 

l-0^2.0 

2* 0-3*0 

3*0-40 

4*0-6*0 

6 ‘0-6*0 

0*O-7‘O 

7*0-8*0 

O 

1 

o 

9'0-I0*i 

No, of oba, on which 

1 

7 

14 

27 

32 

29 

21 

14 

11 

15 

mean ia based 











Mean P{y} 

0*71 

0*73 

0*69 

0*60 

0*68 

0*60 

0*66 

0*67 

0*67 

0*64 



10 *0-11*0 

11 *0-12*0 

12*0-13*0 

13*0-14*0 14*0-16*0 

16*0-16*0 16*0-17*0 

17*0-18*0 

No. of obs, on which 
mean is based 

10 

10 

6 

4 

2 

i 

2 

1 

Mean F(T} 

0*74 

0*66 

0*74 

0*68 

0*82 

1*00 

0*63 

0*63 


then may only take values between 0 and 1 for A =* 1,3,It is required to find the 
joint probability law of the product of two independent variables x and z, where a; and z 
are as defined above. It will Ibe noted that the elementary probability law of a; will be 

■P^X ^ JCy} — Pj {j “ 1^2^,, ,J s). 

Hence when x = Xj (the probability of which is pj), the product xz will be distributed reot- 
angularly between 0 and Xj on a proportion pj of oooasions. It follows that xz baa a probability 
distribution which has points of discontinuity at Xi,X 2 , distributed reot- 

angularly between these points of discontinuity, and that 

^wmI j^2 


Generally 
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14. If wo now apply this theory to the combination of the tests of significance of T anfi 
it ia seen that wo must consider the product of and F{T], is a continuous 
variable and 

2 - a 'P{X^)d{t) ^ = ^{X^} 

J Xo 

is distributed rectangularly between 0 and ij and 

2*-2 

is a discontinuous variable taldng known values. The probability integral of xz is thus known 
from theory and rQ .05 or 7o.(ji can be found to satisfy the relation 

P {0 < xz < 7^} «=* e. 

These probability levels are given in Table 6 . The procedure for the joint test of significance 
will be: 

(i) calculate P{T} as described in §7; 

(ii) calculate P{x^} in the usual way. The degrees of freedom will be the number of groups 
minus one; 

(iii) multiply and together and refer to Table 6 to judge the significance of 
the product. 


Table 6 . Values of 7 (,.o 5 and I^oi» P{P(y®) P{T) < 1^} ^ e 

This table may be used to judge the significance of the joint distribution 
of the T criterion and any other oontiniioua criterion. 


r 

♦•i 


Yo-h 

3 ^ 0 .01 

r 

’*1 



5 ^ 0.01 

6 

4 

1 


0-00628 

11 

10 

I 

0 - 0276 + 

0 - 0066 + 


3 

2 


0-0043 


9 

2 

0-0171 

0-0034 







8 

3 

0-0144 

0*0028 

6 

5 

1 


0-0060 


7 

4 

0-0144 

0 - 0026 + 


4 

2 


0-0042 


6 

6 

0-0140 

0-0024 


3 

3 


0-0039 











12 

11 

1 

0-0273 

0 - 0066 " 

7 

6 

1 

0-0292 

0-0068 


10 

2 

0-0174 

0 - 0036 - 


6 

2 

0-0107 

0-00308 


9 

3 

0-0149 

0-0027 


4 

3 

0-0174 

0-0036 


8 

4 

0-0142 

0-0024 







7 

6 

0-0136 

0-0022 

8 

7 

1 

0-0280 



6 

6 

0-0131 

0-0021 


6 

2 

0 - 018 S 






' 


5 

3 

0*0180 



12 

1 

0-0373 

0-0054 


4 

4 

0-0163 

BSIiIH 

’ 

11 

2 

0 - 0106 + 

0-0033 







10 

3 

0-0161 

0-0020 

6 

8 

1 

0-0281 

0-0066 


9 

4 

0-0138 

0-0023 


7 

2 

0-0180 

0-0036 


S 

6 

0-0137 

0-0022 


6 

3 

0-0168 

0-0031 


7 

0 

0-0138 

0 - 0022 « 


5 

4 

0-0140 

0-0028 











14 

13 

1 

0-0269 

0*0064 

10 

0 

1 

0-0278 

0-0066 


12 

2 

0*0163 

0-0033 


8 

2 

0*0176 

0-0036 


11 

3 

0-0161 

0 - 0028 + 


7 

3 

0-0143 

0-0029 


10 

4 

0 - 0136 " 

0 - 0022 * 


6 

4 

0-0143 

0-0020 


9 

6 

0-0138 

0-0023 


5 

6 

0-0143 

0-0026 


8 

0 

0-0130 

0-0022 





_ 


7 

7 

0-0134 

0-0022 
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15, Tile application of the joint test of significance may be illustrated by means of an 
example, A sample of 360 observations is available. This sample has actually been randomly 
drawn from a normal population of which the mean is zero and the standard deviation unity. 
The figures are given in Table 6. Calculations give 21*1 and P{x^} = O’lO. Judging 
by the ^‘'lone we should say probably that there is nothing out of the ordinary in the 
deviations of the sample from the expected values. The number of signs is 16, of which 9 
are positive and 6 negative, and these are arranged in six sets. Making the appropriate 
calculations, we have 

P{6 sets 1 9 positive; 6 negative) — = 0'176. 

The arrangement of signs will therefore be judged as acceptable. The joint significance of a 
= 0*10 and a P{P} = 0*176 is found, by evaluating the joint distribution, to be 0*066. 


Table 6. Sample valuer. Observed and expected 


Central values 

-2-1 

and under 

-1*8 

.-1*6 

„1.2 

-0-9 

-0-6 

-0-3 

0*0 

Observation 

12 

10 

18 

26 

23 

42 

HI 

49 

Expeotation 

0-3 

8*6 

1 

14-0+ 


28-7 

1 

36*0 


43*0- 

Deviation 

4-2*7 

1 

4-1*4 

4-4*0 

4-6*0 

1 

-6*7 

4 -6*1 

4 -2*0 

+ 6-0 


Central values 

+ 0-3 

4-0*0 

4-0*9 

■ 

— 

4 -1*6 

■fl'8 

— 

4 -2*1 
and over 

Total , 

Observation 



20 


■i 

3 

6 


Expectation 

mm 


28*7 



8*6 

0*3 

360 

Deviation 

-6*0 

-7*9 

-8-7 

4 -6*0 

4 -6*0 

-6*6 

-4*3 

0 


16. A study of the basic table (Table 1) of the function T will show that PIP) is not a 
very sensitive criterion with which to judge the randomness of a sequence of signs unless 
the number of groups under consideration is very large. For example, if there are 10 signs, 
5 of which are positive and 5 of which are negative, the probability of getting two sets of 
signs is 0-008. Thus the test would show, and rightly, that the chance of such an arrangement 
is small, but this fact would undoubtedly be recognized by a skilled computer without the 
use of a test at all. In the case of 10 signs the probability of three groups or less is 0*040, 
and this would possibly be judged non-significant. Again, let us consider an extreme case say, 
10 signs, 0 of which are positive and 1 negative. The T criterion does not concern itself with 
the fact that the numbers 9 andl are exceptional,it is merely concerned with decidingwhether 
their arrangement is exceptional given the 9 and 1. Table 1 shows that neither possible 
arrangement would be considered out of the ordinary. It is these points of weakness whicli 
show that the criterion T is not of great utility except in combination with For, if we 
consider the 9 positive, 1 negative case, common sense tells us that the criterion in such 
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a case would possibly be significant. Nine positive deviations have to be balanced-by a 
single negative deviation, and this last is therefore likely to be big. This does not influence T ; 
neither will the contribution of T to the j oint criterion be of much weight. This is as it should 
be, for it is difficult to see how one can postulate a smooth alternatk e for 9 positive, 1 nega-' 
tive, two sets, and not also for 9 positive, 1 negative, three sets, Generally, however, we 
shall not meet such extreme cases in practice. One way of overcoming this weakness of the 
test would be to consider the probability of obtaining positive and negative signs together 
with the probability of obtaining T sets of alternate positive and negative signs given r ^ and r^. 
This is simple enough when considering just a sequence of alternatives, as I have shown 
elsewhere, but it is not easy to fit these results to the problem, nor, when this is possible, 
will the choice of a critical region be straightforward. However, the results of sampling 
experiments will be utilized to throw light on these points and it is hoped to discuss them, 
with other questions arising, in a further publication. 

17. It is possible that there are other criteria, depending on the arrangement of positiye 
and negative signs, which will be more sensitive than the T criterion chosen. For example, 
it is easy to calculate, given and the probability that the largest set is composed of a 
sequence of r' positive signs, and there are many other possibilities which might be con¬ 
sidered. It would appear that any criterion based on sign sequences can be shown to be 
independent of by means of geometrical argument, and it will be necessary therefore to 
consider the power of these different sign tests when referred to a specified set of alternate 
hypotheses. 

18. The main objection to the two criteria, T and Plx^}.P{T}, that I have proposed in 
this note is the one which was mentioned earlier; they are only applicable to the case where 
there is just one restriction on i.e. when the totals of expected and observed frequencies 
have been made to agree. It is possible to work out a slightly different form of the T criterion 
for each additional restriction which is put on x\ and this has been done. It is preferable, 
however, to delay publication until the results of an extensive sampling experiment are 
complete in order to verify whether such theoretical assumptions as have been made are 
reasonable. 
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AN EXACT TEST EOR THE EQUALITY OF VARIANCES* 

By R. L. PLACKETT, M.A. 

iNTROBtrCTION 

The problem of testing the equality of variances and covariances in normal distributions 
is one which has received considerable attention; we have compiled a bibliography of some 
sixty papers, and shall issue a survey of these in due course; only papers vital to our discussion 
will be considered here. A precise instance of the type of situation we are considering is as 
follows: measurements of height, span and tibia length are made on each of 20 Englishmen, 
20 Scotsmen, 20 Welshmen and 20 Irishmen; it is required to know if the covariance matrix 
of the three characteristics is the same for each of the four nationalities. I^othing is known 
or assumed about the mean values of these characteristics in the four populations con¬ 
sidered, nor are we interested in testing any hypothesis concerning the means, although 
such a hypothesis may be the object of further investigations which assume that the four 
covariance matrices are the same; this latter assumption is inevitably made in multivariate 
analysis of variance. 

Wilks (1932) has already given the moments of the distribution of his criterion for testing 
the equality of several covariance matrices (on the hypothesis that the matrices are in fact 
equal) and Bishop (1939) put this criterion into an approximate workable shape. The test 
criterion given here differs from that of Wilks and has the advantage when one or two cor¬ 
related characteristics are being measured (height or height and span, for example) that its 
distribution is exactly known whatever the number of populations. Nair (1939) did, it is 
true, give the exact distribution of the Neyman & Pearson (1931) criterion for one mea¬ 
sured characteristic; and the exact distribution for two characteristics of Wilks's generaliza¬ 
tion of their criterion; but the form in which the distribution was obtained is very involved. 
It is interesting to notice that from our standpoint the problem of testing the equality of 
several variances (i.e, the case of one measured characteristic) is, as will appear, brought 
within the framework of multiple correlation theory. In the general case of more than two 
characteristics the moments of the distribution of our criterion, like those of Wilks, are 
available. 

Outline oe method 

In the usual terminology we consider k _p-variate normal distributions and are concerned 
with testing the hypothesis that the corresponding variances and covariances are all equal. 
The method we employ to test this hypothesis is essentially that which has been in use in 
analysis of variance since its origination by Fisher; to test the equality of a set of k quantities 
we test whether (A; — 1) orthogonal linear functions of the quantities are each zero. To illu¬ 
strate the application of this principle in the present instance take the particular case jp — 1, 
i.e. we wish to test the equality of the variances in k univariate normal distributions. If 
a typical observation from the Ith distribution is (Z = 1,2,jfc), form k mutually ortho¬ 
gonal linear functions of the ti such that one is 

u = Zj-h ^ 2 +... 

^ Communication from the National Physical Laboratory. 
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If the (/S: -1) covariances of u and each of the other linear functions are all zero then the 
variances of the k distributions must all be equal; this condition may be expressed by saying 
that the multiple correlation coefficient of u on the other* Hnear functions is zero. lurther, 
if there are n sample values of w then the size of sample drawn from each distribution must 
also be n at least, and if no observations are to be discarded the size of each sample must be 
n exactly. Thus, although it is not a condition of the problem that the sizes of samples drawn 
from the Ic distributions must all be equal, it is a condition of our solution. 

The extension of the foregoing principle to p > 1 is straightforward and is considered in 
detail in the next section; the problem then becomes that of testing the independence of two 
groups of variates, the first of size p, i.e. p expressions of the form u\ and the second of size 
p(ifc-1) comprising all the other orthogonal linear functions. This problem has been treated 
by Wilks (1936,1943) and the relevant distribution is expressible as an incomplete /^-function 
when p - I and. i (fnx all hV, exant distribution ia also known when p = 3 and 4 for fe = 

Finally, since when p = 1 the criterion has the form of a multiple correlation coefficient, the 
power of the test in this instance can be calculated by virtue of the work of Fisher (1928). 


Discussion o,f the test 

A sample of « observations is drawn from each of the k p-variate normal distributions of 
which the 1th has the covariance matrix (1 - 1,2, ...,/c; i,j — 1,2, ...,p). It is required 
to test the hypothesis that 

= ( 1 ) 

The population means do not enter into the hypothesis and have arbitrary unknown values. 
Where i, j, I, m appear henceforth they will be understood to range over the values given 
above unless otherwise stated. The observations may be written in the form of an nxkp 
matrix X such that all those on the ith variate in the 1th distribution are in column (1 — 1) /c +1, 
The ath observation in this column (a = 1, 2,..., w) is denoted by ; the order of the elements 
in a column is assumed to be random. If this ia doubted the observations should be randomly 
rearranged. 

We must emphasize here that the sample value of the criterion to be used to test (1) 
depends on this order, and there is thus, in a sense, a correspondence between x\a and xf^, 
although these two quantities are, of course, uncorrelated when 1 + m. Most tests of a 
hypothesis specifying nothing about the order in which observations are made or written 
down are themselves independent of it; ours is not, and different computers with the same 
data might well come to different conclusions although this does not affect the validity of 
the test, the significance level being overall what it should be. There is probably some loss 
of power which can, however, be offset by imbuing a with a certain physical meaning; but 
we shall not discuss this question here. A criterion for testing normality depending on the 
order of arrangement of observations has been suggested by R. 0. Geary (1935, pp. 316-17). 



and let the corresponding nxkp matrix be Z. If G = Z' Z, where a prime is used to denote 
the transpose of a matrix, then, apart from a factor n, Q is the matrix of sample variances 
and covariances of all variables. We further define S{k,p) as the sum of all (k^P) signed minors 
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formed by rows and columns ..., 171 ^ of 0, where 

(3) 

§{k,p) is similarly defined for the matrix^ = Q-'^ (we shall use this notation for the inverses 
of matrices throughout). 

We now proceed to prove the following 

Theorem: W{k,p) h^pjSik^p) B{k,p) 

is distributed like Wilks’s statistic for testing the hypothesis that two groups of variates 
of sizes p and p{k— 1), known to have been drawn from a (ifcp)-variate normal distribution, 
are mutually independent (Wilks, 1935,1943). Ifthe groups are in fact mutually independent 
then (1) is true. 

Proof. Introduce B,kxk orthogonal matrix B, the elements of whose first column are all 
equal (to + l/^Jk) but which is otherwise quite arbitrary. Put 

r=^{i — l)k + l, u = {m — l)p+j, (4) 

and form a kp x kp matrix A such that 

^ru ~ 

where = 1 (z =j), otherwise 0. Clearly A is also orthogonal. For example, suppose 
k = 4:, p = 2. Apart from a factor of + J multiplying each element, let 

1 1 1 1 . 

11 - 1-1 

1-1 1-1 

1 - 1-1 1 

Then A=10101010. 

10 1 0-1 0-1 0 

10-1 0 1 0-1 0 

10 - 10-1010 
01010101 
0 1 0 1 0-1 0-1 

01 0-1 0 1 0-1 

01 0-1 0-1 0 1 

When p = 1, A == B. Let 



D^XA, Y=ZA, G^Y'Y = A'GA. 


(6) 

Putting 

s = (j — l)k + m, t={l-l)ppi, 


(7) 

and defining 

t' = {l—l)p + i, u' - (m--l)p+j (km = 2, 3,, 

■ ..,k). 

(8) 

we have 

A’igrs) = 


(9) 

so that 



(10) 

Hence when 

<^(cw) = 0 


(11) 

equations (1) 

are satisfied, because for fixed i and j equations (10) 

can he solved and yield 


{n-l)V[j = S’iCij). 


(12) 


Sl-2 
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Denote a typical element of the iJth column of D by d(. Then equations (11) are satisfied if 
and only if di and are mutually independent. 

A criterion fox testing (11), obtained by likelihood-ratio methods, has been given by Wilks 


(1936, 1943). This is 


W{k,p) 



(13) 


and is sometimes called the vector alienation coefficient. Let be the pth conapound of G 
(Aitken, 1939, p. 90), i.e. the matrix of all^ xp minors of G; and the ^?th compound of 
0 - (since is the inverse of our notation is consistent). Then 

W{h,p) - (14) 

by an application of Jacobi’s theorem on the minors of the adjugate (Aitken, 1939, p. 97). 
Now by the Binet-Cauchy theorem (Aitken, 1939, p. 93), 


Consider the elements in the first row of (^^)^\ The first p rows of A^ are of the form 


11 . 

. 1 

00 . 

.0 

00 . 

.0 

... 00 . 

.0 

00 . 

.0 

11 . 

. 1 

00 . 

.0 

... 00 . 

.0 

00 . 

.0 

00 . 

.0 

11 . 

.1 

... 00 . 

.0 

00 . 

..0 

00 . 

.0 

00 . 

..0 

... 11 .. 

.. 1 


apart from the factor ±lj^Jk multiplying each element. Therefore the only non-zero elements 
in the first row of (A^)^^ are those formed by taking one column from each of the p blocks 
of k columns into which the first p rows of A* may be divided. All the non-zero elements 
equal Then from (16) 

cf2 - S{k,p)k-^, — S{k,p) k^^, (16) 

so finally W{hp) - h^^lS{k,p) S{k,p), (17) 

This completes the proof. 

Ga^e of p = I 

Here Wik, 1) == P//S(fc, 1) S{Ic, 1), 

where S(it, 1), S(k,\) are the sums of. all elements of ff, respectively. If (1) is true, 
W(fc, 1), the true value of W(k, 1), is unity. Define 

lF(ifc,l) = and W(&, 1) = 1--R2, (18) 

so that if (1) is true, R == 0. The distribution of = 1 — W{kj 1) when R = 0 is, as Wilks 
pointed out, wellknown, being that of the multiple correlation coefficient (of d ^); 

if in the usual notation 

Ua, h) - [B{a, 6)]-i dx, (19) 

then the cumulative distribution function of a; = is — l,n — k), values near 1 being 

significant; that of a? = ir(^;, 1) being 1) with small values significant. Tables 

in convenient form have been calculated by Thompson (1941); otherwise we can convert 
to the variance-ratio F by 


F^(n-k){l^W)l{k-l)W. 


( 20 ) 
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It is clear that n must exceed k\ for jp variates, n exceeds 'ph in order that 0 may be non¬ 
singular. 

If the matrix A is defined instead as a fcp x hp orthogonal matrix, the elements of whose 


first column are 

all equal (cf. equation (6)), the problem is effectively reduced to the 

case 

p — 1 whatever the value of p, and we can test exactly the somewhat indefinite hypotheses 


yii+v\z+--+y\p = yfi+vj^2+-+y^- 

(21) 

This may be applied in the following manner, for take = 2 and obtain 



yii + ^^12 = yii+ FL = VI + Ffa = Fii -1- Fla. 

(22) 

Thus 

yii = yh and F|i = Fi2. 

(23) 

If it is assumed 

Fil = F?1, 

(24) 

then 

FL = Fla, 

(26) 

and conversely. 

Case of p = 2 



The distribution of W{h, 2) has been given by Wilks (1936). If a; = ^[W{k, 2)], the cumula¬ 
tive distribution function of'x is 

I^{n-2k,2k-2), (26) 


Small values of x are significant and n must exceed 2h 

Case o/ ^ 3 

For fc = 2, p = 3 and 4, the exact distributions are again known and have been given by 
Wilks in equations (35) and (37) respectively of Ms 1936 paper. The expressions are rather 
complicated and we have not reproduced them here. For other values of k and p the moments 
of W(k^p) are available; while more recently Wald & Brookner (1941) have obtained the 
distribution in the form of an infinite series, calculating numerical values for the coefficients 
in certain instances. 

Forp>l, (17) becomes rather intractable as a means of calculating W{k,p), Indeed, 
for k ^ 2 and — 4 it is necessary 

(i) to calculate 36 sample variances and covariances, 

(ii) find the inverse of an 8 x 8 matrix, 

(iii) calculate 512 4 x 4 determinants, 

and it is clearly better to reintroduce the matrix A in some appropriate numerical form, 
calculate Y = ZA and C = Y'Y, and find W{2, 4) from (13), a process wMch involves the 
evaluation of an 8 x 8 and two 4x4 determinants. 


PoWEE OE THE TJiST WHEK p == 1 
From (17) the true value of is in general given by 

i-R2 = py[(sni)(p/ni)]. (27) 

and thus the test -will have equal power for all values of the variances such that the product 
of their sum and the sum of their reciprocals is constant. Consequently 1 — Tr(ifc, 1) is dis¬ 
tributed like the multiple-correlation coefficient in samples from a population where the 
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true value is given by (27). The probability density function of this distribution has been 
deduced by Fisher (1928) and can be integrated to give a finite series when {n - k) is even. 
We find easily when fc = 2 that in the Ffi quarter-plane the equipotentials are pairs 

of lines _ aV\i and aV\i = Fj]., (28) 

where a = (l-i-R)/(l - R). (29) 

For ik > 2 the equipotential surfaces in k dimensions are cones through the origin situated 
symmetrically with regard to the co-ordinate primes. 

Reverting to fc = 2 three methods are available for testing the hypothesis that FJi = V\-^\ 

(i) Fisher’s z or F = exp (2z) 

~ diiidzz' 

(ii) the I-i criterion introduced by Pearson & Neyman (19S0) and later extended to 
fc > 2 (Neyman & Pearson, 1931). 

In the instance we are considering, i.e. equal sample sizes from both populations, 


h = ^(.3119^^)^1(911+9‘ii) (31) 

= 2Fl/(l-}-F). -(31a) 

(iii) If (2,1) = ^[9ngz^-(9iM9ii+gn)^ - (2gn)*].* (32) 

Thus tests (i) and (ii) are exactly equivalent, as is known, the optimum critical region being 
that corresponding to equal tails of the F-distribution. Criterion (iii) is that obtained by 
Morgan (1939) and Pitman (1939), appearing as equation (12) in Morgan’s paper, to test 
that the variances in a normal bivarate population are equal. Morgan has compared the 
powers of tests (i) and (iii) for n = 12, 26 and 100 at a significance level of 0‘10 and for these 
sample sizes it appears that the tests are effectively of equal power. 

When n is large and, consequently, the two populations being independent, ( 912)^/911922 
is converging in probability to zero, 

IF(2,1)~L|. (33) 


The cumulative distribution functions of criteria (ii) and (iii) are respectively 
(Nayer, 1936) {x = L|) and (® = W{2, 1)). Generally, W(k, 1) for large 


n con¬ 


verges in probability to the harmonic mean of the sample variances divided by their arith¬ 
metic mean; (for equal sample sizes) is exactly equal to the geometric mean divided by 
the arithmetic mean. 


Example oe the use op the test eoh a case with ^ 4, ^5 = 1 

It is not easy to calculate W{k, 1) from equation (17) if fc > 3, Indeed, the main value of (17) 
lies in showing the form of solution, and in establishing that this is independent of the 
particular orthogonal transformations used. In the following example, therefore, orthogonal 
transformations are made at once and the multiple correlation coefficient is calculated from, 
the-numerical data. This procedure is far quicker than that involved in calculating 1^(4,1) 
from (17). 


* See Appendix. 
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Below are given samples of 10 from each of four univariate normal populations: 








Xz 


-20 

+ 24 

+ 4 

+ 52 

+ 7 

+ 16 

+ 6 

- 8 

- 1 

+ 18 

+ 9 

-24 

+ 5 

+ 24 

- 1 

+ 66 

-11 

+ 27 

-27 

0 

+ 18 

-12 

+ 1 

-64 

+ 10 1 

+ 21 

+ 6 

+ 48 

+ 13 

-24 

- 4 

+ 12 

- 4 I 

-48 

- 3 

+ 48 

- 6 

+12 

+ 6 

-12 


which have mean zero and standard deviations respectively 10, 30, 10, 40. Make the fol¬ 
lowing orthogonal transformation: 


y^ = Xi^— 1/4 = Xi~X 2 ^x^'\-x^) 

and obtain 


Vi 

— 

Vi 

2/3 

2/4 

2/1 

2/2 

2/3 

Vi 

+ 60 

-62 

-92 

+ 4 

+ 22 

+ 22 

+ 8 

-24 

+ 2 

+ 32 

+ 14 

-62 

+ 84 

-26 

-76 

+ 38 

-11 

+ 43 

-65 

-11 

-67 

+ 69 

+ 95 

-57 

+ 84 i 

-22 1 

-64 

+ 32 

- 3 

-19 

+ 21 

+ 63 

- 7 ' i 

-97 ! 

- 7 

+ 96 

- 1 

+ 13 

_ J 

- 1 

-36 


Porm the matrix of sums of squares and cross-products, i.e. Q, This is 



+ 18636*1 

- 9646-9 

-18232*9 

+ 7325-1 



- 9646-9 

+ 21784-1 

+ 12018*1 

-19015-9 



-18232*9 

+ 12018*1 

+ 28692*1 

- 9445-9 


+ 7325*1 

-19015-9 

- 9445*9 

+ 22008-1 

The matrix of sample correlation coefficients 

is therefore 



1 

-0-4788 

-0*7885 

+ 0-3617 


' -0-4788 

1 

+ 0*4807 

-0-8685 


-0-7885 

+ 0-4807 

1 

-0-3759 


+ 0-3617 

-0-8685 

-0*3759 

1 


Hence the multiple correlation coefficient of y^ on 2 / 3 ) Is given by = 0*637. Calculated 

by the approximation indicated in the last paragraph of the preceding section* ^ 0*656; 
the true value obtained from equation (27) with variances in the ratio 1:3 :1:4 is 0*727. 
The upper 10 and 6 % levels of significance, obtained from Thompson's tables with 
= n — Jc = 6, ^2 = jfc— 1 — 3, are respectively 0*622 and 0*704. We find = 0*665, the 
5 and 1 % levels obtained from Nayer's (1936) tables being respectively 0*797 and 0*719, 
so that this test gives a more significant result than the one based on The relative merits 

of and the test we have provided, which cannot be judged on the results of one example, 
remain a problem to be investigated. 


* I.e. calculated from 1 — (harmonic mean of fjr,.,.)/(arithmetio mean of ga). 
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Summary 

An exact test has been put forward for the equality of variances and covariances in any 
number Jc of 1- or 2-variate normal populations; the testis also exact for two 3- or 4-variate 
populations; but is restricted in application to equal sample sizes n from the h populations 
where n exceeds ph, p being the number of variates. The moments of the criterion are avail¬ 
able for h p-variate populations where the statistic used is equivalent to that employed by 
Wilks (1935) to test the independence of two groups of variates (of sizes p andp(/c-''l)), and 
has the same distribution. In the univariate case the power of the test is known as a function 
of one parameter. Comparison with the criterion has akeady been made when p ~l 
and jfc = 2, the tests being practically the same, and an example worked out of the use of the 
test when p := 1. 

Our thanks are due to E. 0. Eieller for drawing our attention to the papers by Morgan and 
Pitman and suggesting that the test given there for the equality of two variances might be 
extended to more than two; also to Prof. E. S. Pearson for pointing out the need of certain 
explanatory additions. 

The work described above has been carried out as part of the research programme of the 
National Physical Laboratory, and this- paper is published by permission of the Director 
of the Laboratory. 
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THE ESTIMATION FROM INDIVIDUAL RECORDS OF THE 
RELATIONSHIP BETWEEN DOSE AND QUANTAL RESPONSE 

By D. J. FINNEY 

Lsdurer in the Design and Analysis of Scientific Experiment, University of Oxford 

1. Introduction 

A type of biometric problem frequently encountered by the statistician is that whicli 
requires the estimation and study of a relationship between dose and response, 'Dose’ is 
here a general term indicating the magnitude of a stimulus applied to certain test subjects, 
and ' response’ is a measure of the effect which the stimulus produces on the subjects. When 
the test subjects are living matter, whether plants, animals or bacteria, pieces of tissue or 
single cells, the response to a specified dose is unlikely to be constant in repeated trials, and 
regression methods must be used in the estimation of the relationship. 

In some classes of data, the response is' all-or-nothing ^ or quantal, and cannot be measured 
quantitatively. Ordinary regression methods are then no longer applicable; methods based 
on the transformation of the proportion of subjects showing the response at any dose level 
to the normal equivalent deviate (Gaddum, 1933), or to the probit (Bliss, 1934a, 6), however, 
have proved very powerful for simplifying the statistical analysis. Iii recent years, full 
accounts of the underlying theory of these transformations, and of their application, have 
been published by various authors (see, for example, Bliss, 1935a, 6; Binney, 1947, 1948). 
An additional difficulty sometimes found is that the intensity of the stimulus cannot be 
selected in advance of a test, but can only be measured after the test has taken place; only 
rarely will two or more subjects happen to receive exactly the same dose, and more usually 
the records consist of a list of doses with, for each, a statement of whether a single subject 
receiving that dose responded or not.* For example, in some methods for the testing of 
insecticidal potency, poison bait is offered to individual insects; the dose received by any 
insect cannot be specified in advance, and must instead be measured as the amount of poison 
ingested. 

Data from experiments of this kind do not give empirical values for the proportion of 
subjects responding at each dose level, except in the trivial sense that every dose shows either 
zero or 100 % responding. Nevertheless, as Bliss (1938) has pointed out, the probit method 
can still be applied to estimation of the dose-response relationship. He has given a numerical 
example, though without showing full details of the working, but has admitted that assess¬ 
ment of the error of estimation presents some theoretical difficulties (Finney, 1947, §43). 
An interesting example of experimental results requiring this type of analysis has recently 
been brought to the notice of the writer hy Mr R. W. Gilliatt. These introduce an additional 
complication, since the dose is expressed in terms of two measurements, and a probit plane 
(Finney, 1943) or other bivariate regression function must therefore be estimated. An 
account of the analysis, with computational details, may help those who have encountered 
analogous problems in biological or other investigations. 

* When response does not involve death or serious alteration of the test subject, one subject may be 
used many times; the example discussed in this paper is an instance. The form of the data will be the same, 
though the interpretation may require that tolerance variation between and within subjects be dis¬ 
tinguished. 
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2. The data 

Research in human physiology has demonstrated that, under carefully controlled experi¬ 
mental conditions, a transient reflex vaso-constriction in the skin of the digits may follow 
a single deep breath (Bolton, Carmichael & Stiirup, 1936). Gilliatt (1947) has found that the 
response depends in part on the volume of air taken in by the subject. Plethysmographic 
measurement of the volume changes in a finger was used to indicate the occurrence of a 
response, but assessment of the degree of vaso-eonstriction, in order to relate this to the 
inspiratory stimulus, was not practicable. Thus the records obtained for each test show only 



Pig. 1. Contours of dose-response surface for 0-1, 0'26, 0-6, 0-76 and 0-9 frequency of response, 
estimated from three-parameter equation. O no vaso-constrictionj • vaso-constriction. 

the volume of air inspired, the average rate of inspiration, and whether or not vaso-con¬ 
striction was produced. The above brief outline is sufficient for appreciation of the statistical 
problem, but a full account of the experimental procedure may be found in Gilliatt’s paper; 
the results discussed here are presented in his Fig. 5. 

The data, which Mr Gilliatt has kindly made available to the writer, were obtained from 
thirty-nine tests, in twenty of which vaso-constriction occurred. Tests were made on three 
different subjects, nine on D.W., eight on V.P.W., and twenty-two on S. J.S.; the results of 
the tests, with the subjects in this order, are shown in Table 1. In Fig. 1 are shown the 
thirty-nine combinations of volume in litres (F) and rate of inspiration in litres per second 



Table 1. Experimental data and details of calculations 
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( B), together with indioationa of whether or not the subject responded under these conditions. 
Inspection of Fig. 1 shows that, in general, when both V and B were small no response 
occurred, when either was large (unless the other was very small) the response occurred, and 
in an intermediate region the proportion of responses increased as either 7 or i? increased. 
There was no sharply defined threshold separating combinations of 7 and B giving the 
response from those giving no response; instead, there appeared to be a probabiUty of response 
ranging from practical certainty under some conditions to zero under others. 

As an aid to fuller understanding of the influence of breathing on vaso-constriction, ex¬ 
amination of the relationship between 7, B, and the probability of response seemed desirable. 
Since so few observations were available for each subject, the data were unhkely to be sufli- 
cient to show differences between subjects; this point is discussed later, but in the main 
analysis the distinction between subjects is ignored. For any form of response assessment, 
the testing of one subject many times must introduce a danger that the result of one test 
will be afi^ected not only by its own stimulus but by preceding stimuli and by the effects they 
produced. In this investigation, each subject was given a number of preliminary tests until 
he appeared to have settled into the routine. The observations recorded in Table 1 were 
obtained after these preliminary trials; they are tabulated in the order of testing, and show 
no indication of effects of previous history, but clearly such effects would have to be very 
pronounced if they were to be detectable on this amount of data. 

3. Method oe analysis 

Preliminary examination of the data suggested that the occurrence of a response was largely 
determined by the magnitude of 7B, the product of volume and rate, curves on which the 
probability of response has a constant value being approximately hyperbolae of the form 

VB = constant. (1) 

A little consideration shows that an equation of this type is more reasonable than an equation 
linear in 7 and B, though the data are almost certainly inadequate for discriminating between 
many alternative types of relationship that might be postulated. A system of curves similar 
to, but rather more general than, equation (1), namely, 

= constant, (2) 

was selected for trial; this equation may alternatively be regarded as representing a series 
of parallel linear relationships 

/?! log 7 + /?2 log i? = constant (2ci) 

between the logarithms of volume and rate for a fixed probability of response. 

A specified combination of 7 and R will not necessarily always give the same result 
(response or no response) with a subject, for, even though the subject is unaltered, minor 
uncontrolled variations in his environment may affect his susceptibility to the applied 
stimulus. For a particular value of 7, the threshold value of B (the value which under the 
conditions prevailing at any instant would be just sufficient to produce a response) will 
have a frequency distribution; similarly, for a particular B, there will be a frequency dis¬ 
tribution of threshold values of 7. If these distributions may be taken as normal in log 7 
and log R, and, for simplicity, they are supposed to be such that the mean of either logarithm 
is linearly related to the selected value of the other, then the probability of response will be 
determined by an expression of the form 

p^logV+fi^logR, 
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and the threshold values of this quantity will be normally distributed. If and are 
written for log (107) and log(10i2) respectively (the factor of 10 is introduced in order to 
make and always positive), this statement enables the probability of response, P, to 


be expressed as 


P = 




— 00 


1 

V(^) 




(3) 


where a, and /Jg are parameters to be estimated from the data. The estimation may be 
regarded as the fitting of a probit regression plane, for F, the probit of P being given by 


Y — 5 + ot + “h 


(4) 


Substitution of the value of Y corresponding to a specified probability gives the required 
linear relationship, equation {2a), between and x^ for that probability, from which the 
estimated curves of constant probability, equation (2), may easily be derived. 

The procedure for fitting a probit plane has been described elsewhere (Finney, 1943, 1947, 
§31), and its chief features need no alteration for application to individual records. Pro¬ 
viding that a first approximation to the equation can be guessed, repeated cycles of com¬ 
putation will give values for the parameters which approach more and more closely to the 
maximum likelihood estimates. Care in the choice of the first approximation will reduce 
the number of cycles needed; a poor choice will delay the convergence, though it will not 
affect the ultimate result. Since only a single observation is available for each combination 
of and every working pro bit is either a maximum or minimum value, according to 
whether or not the response occurs. When there is only one dose factor, in the fi,tting of a 
probit regression line to records of individuals, grouping of doses and treatment of the 
observations in a group as if they related to aii average dose may reduce the labour of the 
early computing cycles, but, since it will tend to give an underestimate of the regression 
coefficient, the final cycle may need to use the detailed records. Bliss (1938) has given an 
example illustrating grouping of this kind, Grouping is less easily applied, however, when two 
or more dose factors have to be used, and, for the data under discussion, the individual records 
were used throughout except in the formation of the first approximation. 

In the standard form of probit analysis, with moderately large numbers of observations 
at each level of dose, a usually computed for testing the significance of discrepancies 
between the data and the fitted equation; this is numerically the same as would be obtained 
by calculation from expected and observed -numbers of responses and non-responses for 
each dose. If there are few observations in any dose group, the expected number of responses 
or of non-responses (or of both) is likely to be small, and, as is well known, may then fail 
to follow the sampling distribution tabulated for that statistic. Data of the type under 
discussion here are extreme examples of this situation, the number of observations for each 
dose being reduced to unity, so that any disturbance of the distribution is likel}^ to be 
encountered in its most acute form. No complete theoretical investigation of this matter 
has yet been made, but the practical implications are discussed more fully in § 0. 

On the assumption that the estimate of equation (4) is an adequate representation of the 
data, lines of constant response probability may be obtained for any specified probability; 
these may be plotted according to equation (2) on a F, JS scale. Standard statistical processes 
also enable fiducial limits to be assigned to the position of any of these curves. The difficulty 
of dealing with the estimation of error for individual records, and the inadequacy of the 
data for any sensitive test of whether equation (2) is a satisfactory representation of the 
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system of curves, throw doubts on the exact interpretation of these fiducial limits. Never¬ 
theless, they give some idea of the confidence that can be attached to the estimated curves, 
at least for moderate values of V and H; for extremes of either measurement, far more 
extensive data would be needed before much faith could be placed in the fitted equation. 

4. Computations nor estimating the three-parameter equation 
In this and the two succeeding sections, the computations for Gilliatt’s data will be described 
in detail. The first five columns of Table 1 show the thirty-nine pairs of values of F and JS 
which occurred in the experiments, followed by the corresponding values of and 
together with a statement of whether or not the subject responded. Before the probit com¬ 
putations could be initiated, a first approximation to equation (4) was needed; this was 
obtained with the aid of the suggestion, from the plotting of the data shown in Fig. 1, that 
the constant probabihty curves were approximately the hyperbolae of equation (1), or 
alternatively ^ ^ constant. 

As Bliss (1938) has pointed out, there is no objection to the use of overlapping groups in the 
formation of the first approximation. The data were therefore grouped according to the 
value of (^x d* ^ 2 )^ shown below, and the proportion of responses in each group was obtained 
from Table 1: 



Responses 

Proportion 

iP) 

Probit 

First 

approximation 


0/7 

0-00 

_ 

3‘3 

l-G-2^0 

0/7 

0-00 

— 

3-6 


2/7 

0-29 

4-4 

3-9 

1-8-2-2 

2/9 

0-22 

4-2 

4-2 

1-9-2* 3 

SjU 

0-21 

4-2 

4-6 

2-0-2-4 

8/19 

0-42 

4-8 

4-8 

2-l-2‘6 

13/24 

0-54 

61 

5-1 

2‘2-2-6 

ni25 

0-68 

5-6 

5-4 

2‘3-2*7 

16/17 

0-94 

6-6 

5-7 

2-4~2-8 

12/12 

1 

1-00 


6-0 


Each proportion was regarded as an estimate for the median value of (ccx + x^) in the group, 
i.e, I‘7, 1*8, TO, and its probit was read from one of the standard tables (Finney, 1947, 
Table I; Fisher & Yates, 1947, Table IX), As may be seen above, these probits were fairly 
well fitted by the guessed equation 

Y + (5) 

which was therefore used as a first approximation to equation (4). 

A first set of expected probits was calculated from equation (5), and inserted as Y in an 
earlier version of Table 1. A cycle of routine probit calculations, just as described in the next 
two paragraphs, then led to an improved approximation to the required estimate, on which 
a second cycle of improvement was based. The figures shown in Table 1 relate to the fourth 
of these cycles, based upon the approximation 

Y = 9-127+.6-666a;i+5*906a;a (6) 

from the third cycle. Equation (6) is very different from equation (5), suggesting that more 
care might have been given to the selection of a first approximation; that the grouping 
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adopted would lead to underestimation of the regression coefficients was expected, but 
insufficient allowance for this was made. Of course the ‘improvement ’ in the approximations 
refers to their approach to the solution of the maximum likelihood eq^uations, and is not 
necessarily always an approach to the true relationship. 

The column of expected probits, 7, in Table 1 was calculated by substitution of pairs of 
values in equation ( 6 ); one decimal place here is quite sufficient. The weighting coeffi-^ 
cient, Wy for each observation was then read from tables (Rnney, 1947, Table II; Ksher & 
Yates, 1947, Table XI) and entered in its column. The working probit, y, takes a maximum 
value for every observation giving a response and a minimum value for every observation 
giving no response, since these give empirical rates of 100 % and zero respectively; values 
of y were read directly from Finneytable (1947, Table III; or, less simply for the minimum 
values, from Fisher & Yates, 1947, Table XI). The numbers of decimal places shown for the 
entries, in Table 1 axe sufficient for data of this type; indeed possibly one decimal for w and 
for y would be enough. Columns wx<^^ and wy were then filled, and the weighted sums of 
squares and products of deviations, required for the calculation of the regression of y on 
x-^ and x^, were completed at the bottom of the table. 

The equations giving the estimates of the regression coefficients, and are 

0*4945286i^0'38272962 = 1-032130, 

-0-3827296i+0'5i771462 -= 0-516978. 

Later calculations use the variances and covariance of and 6 ^; the equations were therefore 
solved by first obtaining the matrix inverse to that formed by the coefficients of b-^ and 
(Finney, 1943, 1947, §31; Fisher, 1946, §29). This matrix is 

/4*726144 3^493883\ 

W ^^ 22 / \3'493883 4-514482^* ^ ^ 

the accuracy of the data is insufficient to need the number of decimal figures shown here, 

but their retention assists the checking and maintains the internal consistency of the 

analysis. Now = l-032130yii+0.516978 di2 

= 6-68426, 

and similarly = 6-94003. 

The estimate of equation (4) is then 

Y = y + — a:,) -f 62(^2 ^ 2 ! 

or T =-9-182 +6-6843iBi + 5-9400cB2, (8) 

a result which differs little from equation ( 6 ) and may be regarded as a sufficiently close 
approximation to the maximum likelihood estimate. Since 

62/61 = 0-889, (9) 

equation ( 8 ) may be transformed to give 

— constant ( 10 ) 

as the relationship estimated to exist between 7 and B for a specified probability; the value 
of the constant can be obtained by substitution of the probit of the probability in equation 
( 8 ), a process which gives 1-10, 1-36, 1-71, 2-16 and 2-66 for probabilities of 10 , 26, 50, 75 
and 90.% respectively. Typical contours have been drawn in Fig. 1 so as to indicate the form 
of the relationship. 
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6 . Goodness of fit 

When probit analysis is applied to data containing many observations in each dose group, 
the weighted sum of squares of deviations between the empirical probits and the predictions 
from the fitted equations is a with degrees of freedom equal to the number of dose groups 
reduced by the number of fitted parameters. If is writtenfor the weighted sum of products 
of deviations of variates u and v, application of this method here would give 

/V[36] ~ ^UU'^ 

= 40-045 - 6-6843 x 1-0321 ^ 5-9400 x 0-5170 

= 30-08. (11) 

When the dose groups are small, however, the so calculated cannot be trusted as an in¬ 
dicator of the significance of deviations from the fitted equation, and it is presumably most 
unreliable when each group is reduced to a single observation. Apart from slight discrepancies 
caused by imperfect approximation to the maximum likelihood solution, the x^ in equation 
(11) is algebraicallj^ identical with that which would be derived, by the usual form of cab 
culations, from comparison of observed numbers responding and not responding in each 
group with expectations computed from the fitted equation. As is well known from the 
study of contingency tables, when the expectations in some classes are small the sampling 
distribution of such a may be very different from that shown in the standard tables 
(Finney, 1947, Table VI; Fisher & Yates, 1947, Table IV); with data from individual records, 
no class can have an expectation greater than unity, and for many the expectation will be 
very much less, so that the discrepancy from the tabulated x^ distribution is likely to be 
serious. 

The general effect of small expectations on the random sampling distribution of x^ appears 
to be that the mean value remains about equal to the number of degrees of freedom, but that 
tlie variance in repeated sampling is increased. Consequently, samples from a population 
according with the null hypothesis are likely to show an excess of very high and very low 
values, as judged by the tables of x^^ Thus there is little danger that significant evidence of 
deviations from expectation will be overlooked in an uncritical application of the test, though 
apparently significant values of x^ iieed to be examined with care before they are regarded 
as evidence sufficient to justify rejection of the null hypothesis. Low values, as in GiUiatFs 
data 30 with 36 degrees of freedom, need cause little alarm, for they clearly indicate no 
serious deviation from expectation. High values may in the first instance be compared 
with the standard tables of the x^ distribution; if they fall beyond the significance level, a 
closer examination should be made before judging the null hypothesis to be untenable, for 
the apparent significance may be due to large contributions from one or two aberrant points. 
GilliatFs data provide an illustration of this. The expected probits for each pair of values 
of and ojg in Table 1 have been calculated from equation (8), and the probabilities, 
P (= 1 —Q), corresponding to these have been entered in the last column of the table; 
P is then the expectation of the number of responses for each dose. The x^ obtained from the 
observed and expected numbers in seventy-eight classes is easily seen to be the sum of QjP 
for all doses giving a response, plus PjQ for all giving no response. Inspection of the column 
for P shows small contributions to x^ everywhere, except for two instances of responses with 
probabilities of only 0-098 and 0-128, contributing 9 and 7 respectively; clearly the occurrence 
of these two responses as the most extreme events in thirty-nine trials need not be regarded 
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as serious evidence against the null hypothesis. The result of oalculating by this more 
laborious process is a total of 30'3, which agrees closely with that already given in equa- 
tion (11). 

One method of modifying a test so as to remove its extreme sensitivity to deviations 
from small expectations is to combine expected and observed frequencies over several 
adjacent groups, so as to obtain groups with larger expectations j the number of degrees of 
freedom is then taken as the number of remaining groups less the number of fitted para¬ 
meters. Of course the groups must be chosen objectively, and without regard to the agree¬ 
ment between the frequencies. The statistic still will not follow the distribution exactly, 
but the approximation should be fairly satisfactory under the usual restriction that the 
groups he so chosen that none of the expected frequencies is small. This procedure often has 
to be adopted in probit analysis because of small expectations at very low or very high doses 
(Pinney, 1947, §18). With individual records, however, only very extensive grouping will 
give expectations sufficiently large for the ^ test to be trusted; the reduction of a large 
to a value below the significance level might then appear indicative of an insensitive test 
rather than of absence of serious discrepancies.* 

Probably no completely satisfactory solution of the difficulty is to be expected. Individual 
records usually arise from experimental work in which the obtaining of large numbers of 
observations presents considerable difficulty. Often the whole series will consist of less than 
fifty observations, and, unless previous information enables the range of doses to be chosen 
satisfactorily, many of the observations will be made at doses for which response is either 
almost certain or almost impossible. Even if the individual dose-tolerances could be measured 
directly, a test of normality of their distribution (which is what the test attempts to 
provide) could not be very sensitive when based on only fifty measurements; if, instead, 
only quantal data are available, indicating merely whether a dose is below or above the 
tolerance value, a sensitive normality test is still less likely to exist (Finney, 1947, § 43). 

Gilliatt’s data, a series of only thirty-nine observations, provide an extreme instance of 
the difficulty of formulating a sensitive test of goodness of fit. Nevertheless, an attempt has 
been made to examine the discrepancies between the observations and the nuU'hypothesis 
expressed by equation (4). In Table 2 are compared the observed and expected frequencies 
when the data are grouped according to the value of This is equivalent to a grouping 
based on the value of T, the expected probit in equation (8), and, as this quantity had been 
evaluated for each observation in order to give P, it was used in the construction of Table 2. 
Since three parameters have been estimated from the data, four groups is the least number 
for giving a test. The limits of the groups were chosen so as to give similar numbers of 
observations in each. Inspection of Table 2 shows that the groups are still too small for a 
test to be trusted, thus suggesting that the data are inadequate for any useful test of goodness 
of fit to be made. The only anomaly in Table 2 is the occurrence of two responses where the 
expectation is 0*3, and this is clearly insufficient to cause much worry. 

The inadequacy of the data for detecting any differences in sensitivity between the three 
subjects may he seen from Table 3. The first nine entries in Table 1 relate to D.W., and sum- 

* In his discussion of the analysis of individual records, Bliss (1938) suggests adjustment of the x® 
test, not hy altering the calculation of the statistic but by reducing the number of degrees of freedom 
allotted to it; he gives an empirical rule for the reduction, based upon the expectations in terminal dose 
groups. This method, however, not only lacks any theoretical basis, but seems liable to have an effect 
opposite to that which is needed; it will attribute significance to high values of even more readily 
than will the unadjusted test. 
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mation of the values of P gives the expected number of responses for this subject; similarly 
the next eight and the last twenty-^two entries give the numbers for V.P.W. and S J.S. 
respectively. Inspection of Table 1 shows that the tests on each subject were fairly widely 
distributed over the range of values of and (rg. Table 3 shows excellent agreement between 
totals of observed and expected responses for each subject, thus suggesting that any in¬ 
dividual differences that exist are small by comparison with the variation in sensitivity of 
the same subject in different tests. 


Table 2, Comparison of observed and expected frequencies of response 


Range of Y 

Frequencies of results 

Observed 
- + 

Total 

Expected 

+ 

-4 

8 

2 

10 

9*72 

0-28 

4r-6 

6 

0 

6 

3’92 

2-08 

5-6 

6 

8 

13 

4*26 

8-74 

6 - 

0 

10 

10 

0-69 

9-41 

Total 

19 

20 

39 

18-49 

2051 


Table 3. Comparison of subjects 


tSubject 

Frequencies of results 

Observed 

Total 

Expected 

4 

B.W. 

3 

6 

9 

4-0 

6-0 

V.P.W. 

4 

4 

8 

3-6 

4-6 

S.I.S, 

12 

10 

22 

11-0 

11-0 

Total 

19 

20 

39 

18-6 

20-6 


6. Limits of erbor 

The variances of b-^ and b^ and the covariance between them are respectively ^12 

as defined in equation (7). Hence the variance of F, the expected probit corresponding to 
any pair of values ajg, is 

F( 7) - ^ 4- v^^{x^ - xf)^ + 2v^^(x^ - xf) {x ^- ^ 2 ) + v^^{x^ - (12) 

where 8w is the sum of the w column in Table 1. All these variances are derived from binomial 
probability distributions. In the usual form of probit analysis, with a batch of subjects at 
each dose, the precision of the estimated relationship between dose and response is discussed 
as though the variation were normal, an assumption which is justifiable on account of the 
large numbers of individuals involved. Here, with only thirty-nine observations in all, the 
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assumption is less safe, but may be adopted for lack of any more trustworthy method of 
dealing with the data. It is unlikely to be seriously misleading, except possibly for extreme 
levels of the response probability, P. 

Equation (12) may now be used in the assignment of fiducial limits to any one of the ourve.s 
of equal probability given by equation (10). For suppose that t is the normal deviate corre¬ 
sponding to the significance level to be used in defining the fiducial limits, and that is the 
probit of a probability P^. Then for any values of for which 

(y-yo)^>4"7(r). 



Fig. 2. Fiducial limits (5 pi^obability) to 0-5 frequency contour of Fig. 1. 

O no vasO'Constriction; ® vaso-oonstriction. 

where Y is determined from equation (8), the expected probit differs significantly from Yg, 
and for values of x^ which reverse the inequality the difference is not significant. Therefore 
the equation (Y-Yf,f ^ tW(Y) ' (13) 

gives the limiting values of (aii, for which the null hypothesis that the true expected probit 
is Ya is not untenable in the light cf the data; in other words, equation (13) defines curves in 
the (ajj, ajj) plane which are fiducial limits to the estimated locus of points having a constant 
response probability P^. These curves are clearly hyperbolae. In Pigs. 2 and 3, the 6 % 
fiducial limit curves [t = 1-960) for = 0-6 and Po = 0-9 respectively have been plotted in 
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the (VI Ji) plftne; details of the calculation, need not be given here, but Fig. 2, for example, 
is derived from the equation 

(14-182-6>6843a;i-5-9400a;2)2 = 3-841 j^^^ + 4-7261(a;i-1-0495)2 

+ 6-9878(a;3, - 1-0496) (*2-1-2483) + 4-5145(*2~ l-2483)2j. 

The pairs of curves are like hyperbolae in form. That for P(, == 0-5 defines a band on either 
side of the estimated relationship which is quite narrow for moderate values of V and JB 



irig» 3. Fiducial limits (6 % probability) to 0*9 frequency contour of Fig. 1. 
O no vaso-constriction; • vaso-constrietion. 


though naturally it widens considerably at the extremes. That for Pq = 0*9, as might be 
expected from general consideration of the problem, allows much greater uncertainty on the 
side of high values of V and B; similarly, for ^ 0-1, that band would be relatively wider 
on the side of low values of V or B. 

The curves shown in Figs. 1, 2 and 3 may be regarded as plane sections, for selected values 
of F, of a three-dimensional diagram relating F to F and B. In terms of instead of 
F, jB, this diagram is the three-dimensional analogue of the familiar diagram showing a 
regression line with hyperbolic curves indicating limits of error on either side; the line 
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generalizes to a plane, and the limits are now defined by two sheets of a hyperboloid, one 
above and one below the plane. 

The theoretical basis of the curves illustrated in Figs. 2 and 3 is perhaps insecure, but 
undoubtedly they give a useful indication of the dependence of the probability of response 
on V and E and of the reliabihty of the estimation of this relationship. Much as an experi¬ 
menter might wish for a. more precise assessment of the effects of V and R, experience sug¬ 
gests that results such as those obtained here are as good as can be expected from a total of 
thirty-nine quanta! observations. 

7. The two-parameter equation 

In § 3 , the equation FR = constant (1) 

was suggested as an expression of the curves of constant response probability, but the more 
complex equation ( 2 ) was adopted for use ha §§4r-6. There are no theoretical reasons for 
believing that equation ( 1 ) represents the true form of the relationship, and the more general 
form was chosen in order that the complete calculations might be illustrated. The values of 
bi and obtained, however, do not differ very greatly by comparison with the standard 
error of their difference; in fact 

F(fei-y = 

= 2-263, 

and therefore - 62 = 0-74:4 ± 1-601. 

In the absence of any significant difference between the regression coefficients, the common 
scientific procedure of preferring the simpler hypothesis (Occam’s Razor) suggests that 
equation (4) might be replaced by 

Y — a,-]- (14) 

For the estimation of equation (14), the computations are similar to, but shorter than, those 
of § 4, since (x^ + x^) may be replaced by a single variate, x, and a simple regression calculated; 
the calculations in §4 were used ha give a first set of expected probits, from which was 
derived the estimate 7 =- 9-475 + 6 - 4067 (a;i-Ha: 2 ). (16) 

Only two parameters have been estimated from the data, and calculation as for equation 

y23,j = 28-76. 

The difference between the two values may be taken as a further criterion of whether or 
not the extra parameter is needed, closely related to the test of significance of {bi — bz)*, 

xfi] = 1*32 

is not significant, though again the validity of the test is in doubt. 

Substitution of the probit of a specified probabihty in equation (16) gives the value of the 
constant in equation ( 1 ). Eor the 60 % response probability, for example, the constant is 
1-82; over the range of values tested, the curves 

^ 1.71 _ i.g 2 

differ only slightly. Similarly, fiducial limits to + may be calculated, for any Yq^ as 
upper and lower values of the product KR. No special interest attaches to these calculations; 
the novelties due to the individual records are exactly as for the three-parameter equation 
discussed in earlier sections, and otherwise the method is entirely that of ordinary probit 
analysis (Finney, 1947, Chapter 4). For comparison with the three-parameter equation, 
diagrams similar to Figs. 2 and 3 may be prepared; both the constant probability curves and 



D. J. Finney 


333 

the fldaoial limits are then true hyperbolae. Pig. 4 shows the results for a 50 % response 
probability, and is to be compared with Pig. 2 . The constant probability curve in Pig. 4 
differs little from that in Pig. 2 , though naturally the difference increases for large values of 
VovR where the curves are less well determined. For moderate values of V and R, the fiducial 



Fig. 4. Contour of dose-response surface for 0*6 frequency of response, estimated from two-parameter 
equation, and its 5 % fiducial limits (compare Fig. 2). O no vaso-constriction; % vaso-constriction. 

limit ouryes are practically the same as the corresponding curves in Fig. 2 , but for more 
extreme values they lie much closer to the curve of constant probability; since the data 
show no significant difference between and 62 , it is to be expected that a more precisely 
estimated relationship between stimulus and response will be obtained if an assumption that 
A A Is made, so that the information on the two regression coefficients can he combined, 
and this shows itself by narrowing the zone of error for the constant probability curve. 

8 . Summary 

The method of probit analysis has been developed to assist the study of the relationship 
between the magnitude of a stimulus and the proportion of tests in which a particular 
quantal response to that stimulus appears. In some research problems, the stimulus cannot 
be controlled sufficiently to make possible the administration of a specified magnitude, 
though the stimulus actually received by any one subject can later be measured. It will 
then seldom happen that two subjects receive exactly the same ‘dose’, and the data for 
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statistical analysis will generally consist of a series of doses with, for each, a statement of 
■whether or not a single subject showed the characteristic response. 

Even for data of this type, the probit transformation can aid the estimation of the relation¬ 
ship between dose and the probability of response. The calculations leading to the estimate 
are more tedious than is usual in probit analysis, because of slow convergence from a pro¬ 
visional equation to the final form, but follow the usual pattern. The validity of the test 
of goodness of fit (in reality a test for the normality of distribution of individual tolerances) 
must be doubted, however, since the disturbance due to small class numbers will be en¬ 
countered in its most extreme form. Extensive grouping of results for adjacent doses will 
provide a test less open to objection, thougli this will generally be insensitive to all but the 
grossest deviations from normality; indeed, no valid sensitive test is to be expected with 
individual records unless these are very numerous. 

In this paper, the calculations have been illustrated on data relating to a reflex vaso¬ 
constriction which sometimes occurs in the skin of the digits of human subjects after a single 
deep breath. The relationship between the occurrence of this response and two dose factors, 
the volume and the rate of inspiration, has been estimated for the combined records from 
three subjects; inclusion of two dose factors complicates the analysis, since a bi-variate 
regression equation must be fitted, but does not affect the underlying theory. The ’)^ test 
has been discussed at length, though there is no indication of non-normality or of hetero¬ 
geneity of the data. The reliability with which the dependence of the probability of response 
on the dose factors is estimated has also been examined, and curves bounding fiducial 
regions, within which the true probability contours may confidently be asserted to lie, have 
been determined. This method of representing the limits of error is applicable to other forms 
of probit analysis involving two dose factors and is not restricted to individual records, 
though it has not previously been described. 

I am indebted to Mr R, W. Gilliatt, of the. Department of Physiology, both for permission 
to make use of his data in an illustration of the statistical methods of my paper and for 
assistance in describing his experimental procedure. My thanks are due also to Miss M, Callow, 
who prepared Figs, 1-4. 
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A POWER FUNCTION FOB TESTS OF RANDOMNESS 
IN A SEQUENCE OF ALTERNATIVES 

By F. N. DAVID 


L During recent years attention has been focused on what might be called the ‘group’ 
test for randomness in a sequence of alternatives. Thus, if E denote the happening of an 
event, and E its negation, the number of alternations of E and J in a sequence supposedly 
random has been chosen as a test criterion. This test has been put to different uses by 
W. L. Stevens (1939), A. Wald & J. Wolfowitz (1940) andT. N. David (1947). It seems worth 
while therefore, to enquire what is the power of this test against a set of specifically defined 
alternate hypotheses. The hypothesis to be tested wiU be that there is randomness within 
the sequence, with the alternate hypothesis that if‘ there is no randomness then there is 
dependence of the type found in a simple Markoff chain, The same procedure will hold good 
for dependence of the tyjies found in more complex chains although in these cases the 
enumeration is a little troublesome. 


2. If there is a sequence of dependent events 

^1, -£^3) 

then it is an elementary proposition of the probability calculus that 

P{E,E,E ,... = P{E,} P{E, I E,}P{E, | E^E,}... P{E, | E,E,...E,_,}. 

If the events are independent, then 

P{E,E,E, ... = P{E^}P{E,]P{E,}... P{E,}. 

This relation will be the basis of the hypothesis to be tested. If there is dependence as 
in a simple Markoff chain, then mathematically each event will be dependent on the event 
immediately preceding it, but will be independent of any of the other events. In this case 
we shall have 

P{E,E,E ,... E,_,E^} = P{E,} P{E, \ E^} P{E, \E,}... P{E, \ E„_,}. 

This relation will be the basis of H-y, the hypothesis alternate to Hg. 


3. For the hypothesis, Hg, let the probability that an event E will occur in a single trial 
be p, and let the probability of S (the negation of E), be q, where p + q^ 1. The probability 
of obtaining any given sequence of ry E’s and r^E’s will he 


The number of ways in which iS’s and may be arranged to form and 2^+1 sets of 

jB’s and J’s alternately is 


2(ri-l)!(ra-l)! 


and /a+i=/2/X 


Ty-j-T^— 2i 


= (t_ 1)1 {ry-t)\ (Ta-t)! 2t 

Writing ifc = 2i or 2i +1 as desired, the probability of obtaining a sequence of Ty E’s and r^E’s 
arranged in A; sets is ^ ^ 

P{k \ Ty, ra, H,} - ^ ^ . 

AlH A]U 

h may take values 2, 3,2r2, if = fg, and values 2, 3,2f2-f 1 if ri>r 2 . 
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4. Following the orthodox procedure, in order to test the hypothesis, Hq, it is necessary 
to find two numbers and ig such that 

jp|fc < I JSq} ^ \e , P{h > fcg I ^ l^e, 

and therefore — 

where e is a number arbitrarily at choice. If an observed number of sets, say k', falls outside 
the limits k-^ and k^ then the hypothesis Hq will be rejected in favour of some alternate 
hypothesis, H-^. Alternately if is not true, but is, then 

1 — P{kx < fc < I Hff 

will be the power of the test in the sense of the word as used by Neyman & Pearson. Whether 
fcj or fcg is chosen to judge the significance of an observed k' will depend on which departure 
from randomness it is most important not to overlook. If the alternate hypothesis is that 
there is positive dependence in the chain, i.e. that E having occurred in the sth trial it is 
more likely to occur in the (s + l)st trial, then k^ would be chosen. Such a situation was 
envisaged in a proposed smooth test to supplement the criterion (David, 1947). If, however, 
the alternate hypothesis is that there is negative dependence, i.e. that E having occurred in 
the sth trial, it is less likely to occur in the (s + l)st trial, then fcg would be the appropriate 
criterion. If it is immaterial whether the departure from randomness is positive or negative 
dependence, then both and /cg may be used. 

5. We now consider the alternate hypothesis, Write for the occurrence of the 
event E in the sth trial and Eg for its negation. Let 

P{Ex} = P, _ P{Ji} =Q, P+Q=^l and F^Q, 

P{Eg\Eg_x}^Px> = 

P{K\Ps-i} = P,, P{P.\Es-i} = q,- 

Thus and are probabilities of no change and probabilities of a change. If the 

events are independent then 

P and g-i = g-a = Q. 


6. In calculating the probability of obtaining any given sequence, what will matter will 
be the number of changes from E to E and back again. Let/^(ri) be the number of ways in 
which j"!P’s.can be arranged in t groups, i.e. let 


f(r \ - ^r- 

(«-!)! (ri-<)!' 

If there are 2t groups in a sequence of P’s and r^E’s, the number of ways of obtaining such 
a sequence will be ^ «« 

if the sequence starts with E or with E. The probability of obtaining any given sequence of 
Tx E’s and r^E’s of 2t groups will be 

This follows from the fact that a sequence of 2t groups beginning with E will imply t changes 
from E toE and t — 1 changes from EtoE.The changes are reversed in number if the sequence 
starts with E. For 2t +1 groups the number of ways of obtaining the sequence will be 

ft-Mftirz) or fi{rx)fi+x{ri) 

according as the sequence begins with E or E. The respective probabilities will be 
Ppip'f-'-'^q{ql^-‘ and Qq'xql^-^~^plpl^-*. 



' F . N. David 


337 


The probability therefore of obtaining a sequence of r^E's and r^E's in 2« groups will be 
therefore, under hypothesis 

(??) f) 

n 

The probability of obtaining r-iE’s and r^E’s in 2i+1 groups will be similarly 

F{S(+11 r,r,H,} - 7. /f fl\ p -T- 1 ' 

7. So far no mention has been made of any possible connexion between q^, S'a- 

It is obvious in all cases we shall have 

^1 + 21 = 1 , ^2 + 22 = 1 . 

but the connexion betweenpi andpj is not immediate. We shall make the simplifying assump¬ 
tion -which is perhaps most closely related to practical problems, and shall state that where 
nothing is known about the s -1 trials preceding the ath trial, P{Es} = P and P{PJ = Q. 
Under this assumption we have 

-P?l 1 -P?! 

P2 - Q . 22 - 1 Q • 

This result is reached easily by noticing that 

F{E,} - P{E,E,^i} + P{EA-.i} = m-i} 11 Sa-i) 

whence P — Pj’i+ ^Pz- 


8. The alternative hypothesis chosen to illustrate the power function formulae is that 
there is positive dependence in the sequence, i.e. ki is found so that 

P{k^kilSa}<e and 1 - P{fc > ifci | Hi} 

is calculated, when pi > P. For economy of dra-wing, several power curves or what are really 
sections of a kind of power surface, plotted to coordinates P, p^, have been put together in 
the diagrams of Fig. 1. For example the bottom left-hand diagram shows for 10 

sections of the conditional power surface for P = 0-6, 0-6 and 0-75. When H, is true and 
P yvQ have the 5 % risk of rejecting i?o wrongly. As pi-P increases the chance of 
detecting the fact increases, hut in a way dependent on P. The other three diagrams show 

.similar sections of the surfaces with ’"i ~ ^2 ~ ® = 7, - 3. 

In practice it will not be known what the value of P is, but the curves show reasonably well 
how the power of the test varies as P andpi (and thereforepa) vary. It is clear that the test 
for randomness under discussion is most powerful when the numbers of alternates are equal, 
i.e. when = rg. The power dechnes sharply when fi increases at the expense of r^. 
Another point which emerges is that the test is only moderately powerful, against the given 
alternate hypothesis tested, when ri-l-r 2 = 20, and it would appear therefore that if it 
was desired not to overlook a possible departnre.from randomness in the form of positive 
dependence in the chain, then the length of the sequence should consist of at least 20 umts. 
The question of other possible tests we shall not discuss at this stage. 



Power of test Power of test 
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Sequeiv^iO of 20, 14, ^ ^ 



Sequence of 10, ?\ = 7i = 3 




Scale for P and pi Seale for P and Pi 

Fig. 1. Conditional power curves when the alternate hypothesis is positive dependence. 








339 


F. N. David 

9. It will be noticed that or 2i +11 which have been loosely termed power 

function formulae are not power functions in the sense originally defined by Neyman & 
Pearson, but they appear to involve a justifiable extension of that idea. In order to dis¬ 
tinguish them from the usual meaning of the words power function, I shall refer to them as 
conditional power functions. The theory of the conditional power function may be stated 
briefly in the following way. It is assumed that all possible samples (or sequences) may be 
classified according to their composition. Suppose that there are h of these mutually exclu¬ 
sive classes, which are also the only possible, say C^, ..., Cj.. We have considered only the 
case where h is finite but it appears likely that the method can be extended to cover the case 
where la is enumerably infinite. These classes, C-y, Oj,..., Ci, will correspond to regions forming 
a partition of the sample space. 

Let be the hypothesis tested and Wg be the critical region used for the rejection of this 
hypothesis. Given that a sample is in Gj (say), and that an alternate hypothesis Hy is true, 
then the probability that Hg will be rejected is 

where means the region common to Wq and to and, following the Neyman-Pearson 
notation, E is the sample point. Regarded as a function of this is the conditional power 
function of the test associated with Wq in the subset 0^ of samples. 

The Neyman-Pearson power function, which we might call here the overall power function, 
will be fc k 

P{Eewg IH,} = S P{PewgCi | i?i} = S 1 Hy}P{E6C, \ Hy), 

which may be looked on as a weighted average of the conditional power functions. 

10. There seems to be no reason why Wq should not be built up of portions w^Gf, these 

portions being chosen to maximize each term of the summation, i.e. 'w?q 0^ chosen to maximize 
the conditional power function. For example, to revert to the specific case of randomness 
within a sequence with which we have been dealing, the different partitions of r (= +^ 2 ) are 

the mutually exclusive and only possible classes C^. It is conceivable, although practically 
not very likely, that for each of these classes there will exist a different test which is more 
powerful to detect specifically defined departures from the basic hypothesis tested than 
any other test. The decision as to which is the most powerful test, against the same specifically 
defined alternatives, to use for any given class will be decided by the conditional power 
function. Once this has been decided the procedure for the complefce test of significance may 
be laid down. This will be: (i) count the number of alternatives in the sequence, i.e. find 
and rgj (ii) from (i) decide the appropriate test of significance to use, (iii) apply the test. The 
power of the test as laid down by (i), (ii) and (iii), in the usual meaning of the word, wiU be 
given by the overall power function. 

It is proposed to discuss these, and other applications of the conditional power function 
technique, in a further publication. I have been concerned here with trying to explain what 
I believe to be the basic ideas, and to forestall possible criticism that I am falling into error 
(of the third kind) and am choosing the test falsely to suit the significance of the sample. 
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A NUMERICAL SOLUTION OF THE PROBLEM OP MOMENTS 
By H. 0. HARTLEY and S. H. K.HAMIS 
1. Inteodtjotion 

Given a statistical variable x and its frequency distribution f{x), then, under certain con¬ 
tinuity conditions for/(a:), the moments 

(r = 0,1,2,...) (1) 

can be eyaluated for any integer r. For certain, distributions/(a;) the integrations in (1) can 
be carried out analytically resulting in simple formulae for the moments. In general there 
is no inherent difficulty in obtaining numerical values for the moments by numerical 
quadrature, - 

The inverse problem is to find the distribution/(a;) given the moments This problem, 
commonly known as ‘The Problem of Moments’, has received considerable attention by 
mathematicians and is of interest in statistical distribution theory. There are numerous 
statistics for which it is difficult to obtain a formula of the random sampling distribution 
f{x) amenable to numerical evaluation. On the other hand, in such cases it is often possible 
to find simple formulae for the random sampling moments (Bartlett, 1937). Sometimes such 
formulae are available for all integer r; more often than not, however, is only known for 
a limited member of small r (e.g. r = 0,1,,.., 6), A simple method of ‘ determining" /(x) from 
the given moments would therefore be helpful in such cases. 

Examples of variables of this kind are the numerous moment statistics or ^-statistics for 
which random sampling moments can be evaluated, notably by R, A. Fisher’s (1929, 1930) 
combinatorial methods, whilst their exact sampling distributions are usually unknown. 
As related statistics we should mention her6 the moment ratios and 6^ used in tests 
for deviation from normality (Geary, 1947, Geary & Worlledge, 1947). For these, the 
low-order moments are known exactly. A similar situation arises with statistics defined as 
likelihood ratios, as, for instance, with the criterion required for testing heterogeneity 
in a set of variances. Moments for this statistic were obtained by Neyman & Pearson as early 
as 1931, yet, although approximations have been obtained (Bartlett, 1937; Hartley, 

1940; Nayer, 1936; Neyman& Pearson, 1931; Sukhatme, 1936; Welch, 1935, 1936), there 
is still considerable doubt about their accuracy in certain oases, and the exact formula 
obtained by Nair (1936) in the case of equal sample sizes is very complex. 

These and numerous other problems of distribution point to the necessity of developing 
a numerical technique to deal with the following situation: 

(i) A random variable x ranging between a and b (where a may be — oo and b may be 4- oo) 
has a distribution function/(x) known to have a continuous derivative of order n. 

(ii) The moments 

(r-0,l,..,,jR), (2) 

are known numerically to any decimal accuracy desired but for a limited number of positive 
integersr, viz. r = 0,1,i?. With the knowledge about/(x) limited to the above conditions, 

is it possible to obtain numerical values for the probability integral P(x) = f f(x)dx 

j a 
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depending on the moments only, and is it possible to make a statement on the accuracy of 
these values in terms of the derivatives of the function/(a;)? 

Problems of this kind have hitherto been treated principally in two ways: 

(а) When J2 = 2, 3 or 4 nothing better can be expected than a ^good fit’, which is often 
achieved by fitting the appropriate Pearson-type curve, 

(б) With B in the neighbourhood of 5-*-8, expansions of the Gram Oharlier, Laguerre or 
Jacobi type have been used, either as cumulant or as moment expansions. Such theorems 
as are available fox statements on the convergence and asymptotic behaviour of these 
expansions usually require too many moments to be known. Often the expansions are only ■ 
asymptotic, and unless the distribution is close to the generating curve (Normal for Gram 
Oharlier, F for Laguerre), the results are often disappointing (see, for example, Kendall, 
1945, Chapter 6). 

2. Outline os’ present method 

The method to be developed here is a direct application of finite-difference calculus and 
therefore provides both numerical answers to the problem, as well as gauges of their accuracy 
in form of remainder terms. The method is, in fact, closely linked with interpolation technique. 
When using any of the well-known interpolation formulae no mathematically rigorous 
statement on the accuracy of the interpolates can be made unless the magnitude of the 
remainder term can be estimated, and for this some knowledge about (say) the nth derivative 
of the function is required. Yet, in using such formulae the convergence of the difference 
table inspires confidence that ^the results of the interpolation can be accepted as a working 
hypothesis’ (Milne Thomson, 1933, p. 62), Similarly, with the present method we shall give 
a numerical procedure of obtaining values of the probability integral. Certain checks of 
internal consistency will be described which inspire confidence that the answers are correct, 
but no rigorous statement on the accuracy can, of course, be made if this is to be based on 
a finite number of moments alone. The exact remainder terms whioh‘we derive will entail 
the high-order derivatives of/(a;), and it is hoped, in a second communication, to derive 
some general statements concerning their order of magnitude. 

In order to simplify the argument we assume in this section that the range of x is finite 
{a and b finite). 

/•t 

The aim is to determine the probability integral of x, P{x) = f{x)dx in tabular form, 

J CL 

i.e. wo wish to determine numerical values of 

P{Xi)==\'f{x)dx (3) 

J a 

for discrete values of Tor convenience the group intervals will generally be chosen 

equidistant (group interval = h)^ and the number of intervals will be i? 4-1, i.o. equal to the 
number of given moments (including yW-o = !)• Hence 

The first differences in the table derived from equation (3) are the quantities 

fi = Pi-Pi-x = f[x)dx, (5) 

and are the familiar ‘ frequencies ’ in a grouped frequency distribution with equidistant 
intervals (see Fig. 1). The link between these frequencies and the exact momentsis then 
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established by the well-known formulae for Sheppard’s correction. Using Kendall’s (1938, 
1945) derivation and remainder term, but extending his notation, we have 

iJ+i 

S IS = ^0, (6) 

where the centre points are given by 

+ (i = + (7) 

(7(r, h) denotes Sheppard’s corrective term, viz. 

Hr](hYJ/r\ 1 

‘'(’■•‘1=2.( 2 ) '»> 

and S{r^ h) the remainder term. 





The aim, now, is to use equations (6) to determine the unknown fi from the given 
To this end the remainder S(r, h) must be examined: Most distribution functions have what 
is commonly known as high contact at the terminals of the variate range. This means 
that/(a?), as well as all its derivatives up to order, say, m, vanish at both ends of the range, i.e, 

p)(a) = 0 (i 0, (9) 

If for such functions we define/(a;) = 0 outside the range a^x^bj it will have continuous 
derivatives of up to order m for — oo<cr< +oo. It can then be shown that the remainder 
term is of the form (see, for example, Kendall, 1945, p. 69) 

A) = - h, 6;} {m even), (10) 

uh i 

SAr. h) = .BW+i(i) kM^r, h, &,) (modd), (11) 

a^dj,<:h, 

where the arc the Bernoulli numbers, the are the Bernoulli polynomials of first order, 
the integrand function fc(r, h, x) is defined by 

r-f- ih 

h{TXx)-=xn *( 12 ) 

and its derivatives with regard to x are denoted by In the subsequent sections we shall 
assume (9) to hold (contact of order m), but will discuss the case when (9) is not satisfied 
in § 10. 

The remainder term SJr, h) will usually be small (see, for example, Kendall, 1945, p. 72). 
We shall therefore, in what follows, ignore A) but will discuss the error thereby com¬ 
mitted in §5. 

If, then, in (6) we omit S(r, h) we obtain a system of J?*-f 1 linear equations for the J2 -f-1 
unknowns/^ u+i 

i—1 


( 13 ) 
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The matrix of this system of equations say) is of the form [ \ and has a classical deter¬ 
minant I gJll, sometimes referred to as Vandermonde’s determinant and well known to be 
=t= 0. The system can therefore be inverted once and for all and, for any particular case, the 
unknown/i can then be determined by substituting the right-hand sides of (13), i.e. in 
the inverse matrix Denoting the elements of this inverse matrix by we have the 
system of equations n 

/i = S (14) 

Progressive addition of the/^ yields the from — S and therefore a table of P(a;) at 

interval h\ Finally, intermediate values of P{x) caii be obtained by standard interpolation. 
Alternatively, as described in § 7, we may obtain directly a table of P(a?) at intervar-|/i. 


3. The stahdarb form oe the htjmerioal inversion 

The rant?: of the original matrix is obviously equal to P 4-1, i.e. the number of moments 
given, whilst its elements are the powers of the centre points It is desirable therefore that, 
for any given R, scale and location of the variable x be transformed into a standard form X, 
so that only oue matrix and therefore only one matrix need be calculated fox each P. 
It is most convenient to standardize as follows: 

X-(a:-4(ffi + 6))|^ (iJeven), (15) 

X = (*-Ka + 6))|4^ + | (Rodd). (16) 


It will be seen, therefore, that the range of X is -I-1 and the group interval 

If = X,+i-Xi = 1. 

From the given moments of x those of X {M^ say) about X = 0 can, of course, be calculated 
by the usual binomial formulae, and in what follows we assume that values of if, are given 
numerically. Further, in analogy to (13), we have 

M^ = M, + G(r,l). (17) 


From (15) and (16) we obtain for the new centre points 


and the matrix becomes | (i 
are given, we obtain for 




-^R, ..., 

0 ,... 

,, + ^P for even P 

j 


R~1 

0 , 


R + l 

for odd R. 

(18) 

2 ’ 



'2 



J 


1 

1 

^ 1 ot 


-iP- 

■iY\ 

. Thus, if the first six moments 

1 

1 

1 

1 1 

i 

1 



-3 

-2 

-1 

0 1 

2 

3 



9 

4 

1 

0 1 

4 

9 

i 


-27 

-8 

-1 

0 1 

8 

27 

. 

(19)' 

81 

16 

1 

0 1 

16 

81 



-243 - 

.32 

-1 

0 1 

32 

243 



729 

64 

1 

0 1 

64 

729 




* It is, of course, possible to construct a matrix yielding the P, directly from the fi„ but we are here 
satisfied with determining the/^ first, as they are of independent interest. 
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In practice the important range of It ■will be from 5 to about 8. The iirverse matrix l^g ^ is 
given below, and it is hoped to give Ff ^ Fg ^ and F^^ in a subsequent paper. The inverse 
matrix F^^, the elements of which are denoted by can be written in the form 

Oif = I ( 20 ) 

r=0 


where U;, = i.e. thec^ are suitable common denominators of the U^o, and the 


are given in. the body of the schedule below: 

M«=l Ml Si 

i cjt ?■ = 0 1 2 

3 

4 

M, 

5 

6 

multiplier of column 

1 

720A 

0 

-12 

4 

15 

-6 

-3 

1 


2 

1204 

0 

18 

-9 

-20 

10 

2 

-1 


3 

484 

0 

-36 

36 

13 

-13 

-1 

1 

(21, 

4 

36/4 

36 

0 

-^49 

0 

14 

0 

-1 

6 

48/5 

0 

36 

36 

-13 

-13 

1 

1 


6 

1204 

0 

-18 

-9 

20 

10 

-2 

-1 


7 

7204 

0 

12 

4 

-15 

-6 

3 

1 



In order to use the above system of equations it would be necessary to compute the from 
the given M„ using formula (17). It is obviously more convenient to evaluate, once and for 
all, a matrix Ul-, giving the f^ directly in terms of the given M^. This matrix is given below 
for Ji = 6: 


i 

.n 

ikfo-l 

ilfi 

Ms 

M, 


M, 


1 

4 

0*000 379 

-0*011719 

0*002 344 

0*017 361 

- 0-005 208 

-0*004167 

0-001 389 

2 

^2 

-0*005227 

0*109 376 

-0*034896 

-0*162778 

0-0’72 9n 

0*016 667 

-0-008333 

3 

/a 

0*059161 

-0*683694 

0*618 490 

' 0*263472 

-0-244 792 

-0*020 833 

0-020 833 

4 

/i 

0*891373 

0 

-M71S75 

0 

0-364 167 

0 

-0-027 778 

5 

/e 

0*059161 

0*683 694 

0*618490 

-0*263472 

-0-244 792 

0*020 833 

0-020833 

6 

4 

-0*006 227 

-0*109 376 

-0*034896 

0*162778 

0-072 917 

-0*016 667 

-0-008 333 

7 

./? 

0*000 379 

0*011719 

0*002344 

-0*017 361 

-0-005 208 

0*004167 

0-001 389 


( 22 ) 

Working rule; Baoh/^ is obtained by forming the sum of seven products using the seven coefficients in 
the ith line and applying them to AIq, ....Me, e.g./^ = 0-000 379iWo-0-011 719Mi+...+0-001 389^8. 


4. OaLCULATION of Tiru INOOMPLETB B-FUNOTION 4(8,' 6) 

FROM ITS FIRST SIX MOMENTS 

As an example for the above method we consider the Beta Distribution for p = 8 and 
^ f(x) = [B{8,Q)y^x’{l~xf. 

Using the moments for this distribution about a; = 0, = B(x + r, 6)/jS(8, 6) (r = 0,6) 

and transforming to the standard scale X == 7ir 3*5, we obtain for the moments of X about 
X = 0: ifi == 0*6, ifefg - 1-05, ikfg - h225, ^ 2-77426, - 4*41360 and'- 10*66942. 

Substituting these ih the matrix (22) we obtain values of whose progressive sums are 
shown iuTable 1 (calculated Z^(8,6)). These maybe compared with the ‘ exact ^ values obtained 
(by interpolation), from the Tables of the Incomplete B-function (1934), The worst discrepancy 
is about 2 in the fourth decimal. Higher accuracy can, of course, be obtained if the number 
of moments (ii-f 1) and therefore the number ofincreases (see, for example, §8, where 
the normal curve is obtained to 5-decimal accuracy). 
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A rather gratifying feature of the comparison is the higher decimal accuracy in the tails 
of the distribution. This is a consequence of the sensitivity of the higher moments to changes 
in the tail frequencies. Note also that the elements in the top and bottom lines of the inverse 
matrix (22) are much smaller than those in the other lines, so that any error in the right-hand 
sides of (13) has a smaller effect on the terminal y^. 


Table 1. Oonvpariaon of ‘caleuhted’ and ‘exact’ values o/ 4(8,6) 


X 

X 

Exact 

Calculated 

Difference 10*“® 


1/7 

0-000 11 

0-000 09 

2 


2/7 

0-013 41 

0-013 64 

-’IS 


3/7 

0-14017 ! 

0-139 95 

22 

0*6 

4/7 ■ 

0-489 63 

0-489 81 

-18 

1*6 

6/7 

0-862 61 

0-862 70 

- 9 

1 

6/7 

0-994 11 

0-993 95 

16 

! 

7/7 

1-000 00 

1-000 00 

0 


It might be argued that a further error wiU arise when determining intermediate values 
of 4 interpolation in the ‘calculated’ table. This difficulty could, however, be overcome 
by shifting the grid of group intervals and using a standard X-scale with group end-points 
corresponding to the odd multiples of 1/14 in thereby obtaining 4 at points half-way 
between the arguments of Table 1. Such a method has actually been used in § 7. 

6. The remainder teem 

A formal representation of the remainder term is immediately obtained by reverting to the 
exact equations (6). If we are concerned with distribution functions having contact of order 
m at the terminals, the error contributions to the /j are obtained by substituting the i? -f 1 
remainder terms ((10), (11)) in the inverse matrix v-^. It is convenient to use the 

standard variate X-scale, H — 1 and the matrix when it will be found that 

eTTOxfi~'EUirS^{r,l), (23) 

r -0 

where S„fr, 1) is given by (10) or (11) putting A =, 1 and remembering that the integrand 
function k must be taken in terms of the standard variate X, viz. 

(24) 

Since the arguments 9^. of 1, Z) are unknown it will as a rule be necessary to substitute 
their respective maxima in (23), at the same time taking | | in place of 27^^. 

Although with (23) we have given a formal solution of the error term involved, in a manner 
similar to the remainder terms of interpolation formulae, it will in practice be difficult to 
estimate the magnitude of the error from this formula. It is hoped, therefore, to go into this 
aspect more fully in a second paper. 

6. InSTNITE VABIATB EANOE AND ABTIEICIAL TBUNOATION 

When the range of the variate is infinite, i.e. when a ~ oo and/or b = -f- oo, it is, of course, 
possible to transform the variate x by, say, y = y{x) such that the range ofy is finite. However, 
in general, we shall not be able to assume that the moments of y are known or that they can 

23-2 
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be derived from those of x. It is therefore necessary to adapt our method to deal with an 
infinite variate range. We shall treat here the case = + oo, the case a == — oo being identical 
and the case a = — oo dud 6 = +oo being analogous, 

!Por an infinite variate range, the condition of high contact is now replaced by 


\ixaf'^\x) = 0 (i — 0,1, (25) 


which results in remainder terms analogous to (10) and (11)*. Similarly, in equations (6) 
which correspond to Kendall’s (1945) equations (3-40), the summation now extends from 

r^i 

i = 1 to i = 00 , there being an infinity of frequencies = f{x)dx. Now since the 

J Xj-l 


exist we know that 



(26) 


is convergent. Accordingly 


lim 2) 


ih 




f{x) dx = 0, 


(27) 


iih = {b-a)j{R+l). If, therefore, we denote the above sums by e(r, b) respectively we have, 


from (6), 


n+i 


S fdHe{r,b) = /^r + Gir,h) + S(r,h). 


(28) 


Applying now the previous method we introduce an additional error in the calculation of 
but this error is smaller than ^ | b)\Ti\uir\- 


The precise determination of the e(r, 6) for any given b would, of course, require a knowledge 
of the nature of the convergence in (26), i.e, some external knowledgJe about the distribution 
f{x) which we are seeking to determine numerically. Unfortunately, such knowledge will 
in general not be available. 

However, if b is chosen sufficiently large, thef^ determined for different values of b should 
all yield, by the method of §§ 2 and 3, approximations to the same probability integral P(ir) 
to within the errors of the respective remainder terms S(r, h) and to within the errors intro¬ 
duced by (27). In practice, therefore, one would make an intelligent guess at the likely 
range of b and then test for internal consistency by comparing the probability integral tables 
obtained by varying b over this range. This method, which is illustrated in § 7 gives an idea 
of the accuracy to which the integral has been determined, but no rigorous statement on 
accuracy can be made without appealing to some a 'priori knowledge about/(i^j). It is hoped 
to deal with this aspect more fully in the next paper. 


7. The oauculation of the ;\;-i)ISTribution for 10 begrbes of freedom 
As an illustration of the preceding section, we will now calculate the ^-distribution for 
10 degrees of freedom. This distribution has high contact at either terminal and, although it 
is known to start at a; = x = we shall treat it as a distribution of double infinite range, i.e. 
we shall not make direct use of the information that/(a;) = 0 for a; < 0, and choose a truncated 
range a < a; < 6. 

We have a mean of = 3*0843 2776, and the moments about the mean are given byf 
/4 = 0*4869223, /a = 0*080 6720, = 0*713 2999, /4 == 0*386 6784, 1-8104865. 

* A formula for S(r, h) when the range is infinite will be given in the second paper. 

t These follow from the formulae for the moments about the origin which are ratios of jP-functions 
(see, for example, Kendall, 1946, p, 65), Note that we have used fi and for moments about the 
origin and the mean, respectively. 
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The standard deviation is = 0'7, and with seven group intervals available to cover 
the essential range we should choose h of the order of the standard deviation.* Our first 
attempt is, therefore, (a) h = 0 - 8 . 

ifl) If we make the mean of x the centre point of the innermost interval we have for the 
truncated range a* 3*5 X 0*8 = and 6 =/^l+2•8. For the standard variate Z, 

the origin Z = 0 will coincide with the mean of x and its range will be - 3 - 5 <Z< + 3 ‘ 5 . 
Calculation of the moments (ikT^) of Z and substitution in the matrix ( 22 ) yields the following 
answers for the frequencies/^: 


A- 0-0005, 0-03325, /a-0-26266, 0-42471, 

/g = 0-231 96, . /g = 0-042 06, /, - 0-004 87. 

The calculated frequency {f^) for the interval /^^+ 2 • 0 <^r^/^ 3 !-^ 2•8 is about 0-005, and its 
contribution to [Xq about 0-005 (2-4)® ^ 1 . Since this is an appreciable proportion of it is 
unhkely that the frequencies beyond b — /^i-h 2-8 when substituted in (27) can be neglected, 
i.e» b and h are too small.-j* 

( 6 ) Choosing therefore a larger A, we try — L If we still keep the mean in the centre of 
the truncated range we have a - - -^0-42 and 6 = /i^ + 3-5 - 6*58 ,(we know, of 

course, that f{x) — 0 for cr — 0 so that our will really he the frequency for the interval 
0-085). This time the standard variate is Z = a;so that = /i', and the above 
values can be substituted directly in the matrix ( 22 ) yielding the comparison of calculated 
^-integral and ‘exact* shown in Table 2 . 


Table 2 . Comparison of calculated and exact values of the 


X=x~M-x 

P(a;) exact 

P(a?) calculated 

Dilferenoe 10*-^ 

-2-6 

0-000 06 

0*000 11 

- 5 

-1*5 

0*009 29 

0*008 93 

3b 

-0-5 

0-244 66 

0-244 76 

- 9 

■f 0-6 

0-767 67 

0-767 85 

-18 

+ 1*6 

0-979 02 

0-978 88 

14 

+ 2-5 

0-999 45 

0-999 47 

- 2 

+ 3-6 

1-000 00 

1-000 00 

0 


The maximum error is about 0-0004 and, again, the terminal /^ have a higher decimal 
accuracy. In practice, of course, the exact distribution would not be available for com¬ 
parison. This time the terminal value is about 0-0005 and represents the frequency for 
the interval /^^H- 2 - 5 <a:^/i 3 ^-f- 3 - 5 . Its contribution to is about 0-4, thereby confirming 
that the previous grid of group intervals was too fine. To obtain further confirmation on 
the tail of the distribution, we determine a third set ofby shifting the grid of group intervals 
by 0-5 to the right, retaining the interval A = 1 . This will make a = — S and 6 — 

i.e. 0-08 ^ a: ^ 7-08. For our standard variate Z the origin will now coincide with 0 - 6 . 

* An unsuitable choice of h would, later, fail to satisfy the checks of internal consistency, 
f Comparison with the exact ^-distribution shows that the maximum error in the above is never¬ 
theless not more than 0-006. 
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The values of the are as follows: 

= = 0-7369223, Jfj =-0-7747114, 

14 = +1-344 8394, = -1-834 794, = 3-595 7606. 

Substituting these in the matrix (22) we obtain the following values of/^: 

/i = 0-000 16, /a = 0-07012, /3= 0-44430, /^ = 0-40421, 
fs = 0-076 99, /e = 0-00419, /, = 0-000 04. 

The comparison of the progressive sums of the abovewith the exact ;\;-integral is of similar 
accuracy to that in Table 2. The terminal frequency for ,£ii + 3 ^ a: + 4 is 0-0006 with 

a contribution of about 0-03 to /u'e, indicating that we have now reached a satisfactory 
choice of b. 

As a final check on the internal coirsistency we compare the answers obtained with the 
two last choices of group intervals by merging the tables of P{x) to obtain one table at 
interval 0-6. This is set out in Table 3. The differences provide a fair check on the internal 
consistency to about 3-decimal accuracy of the two separate tables. If a more reliable check 
is desired, three or even four separate tables may be computed, all at the same group interval 
h and merged in the above manner to form a single table at interval or Ih. This procedure 
has the added advantage that interpolation difficulties at the wide interval of h are being 
avoided. 

Table 3. a: = y/or 10 degrees of freedom. Calculated table of P{x) obtained 
from two separate grids of group intervals (^ = 1) 







-2-5 

O'OOOl 

1 



^2-0 

0*0002 

87 

86 

442 

-~-b5 

0-0089 

616 

628 

601 


0*0704 

1744 

1129 

- 174 

^0-6 

0*2448 

2699 

965 

-1122 

0 

0-6147 

2632 

- 167 

- 866 

0-5 

0*7679 ■ 

1610 

-1022 

112 

1-0 

0-9189 

600 

- 910 

480 

1-5 

0*9789 

170 

- 430 

296 

2/0 

0*9969 

36 

- 134 

103 

2-5 

0*9996 

5 

- 31 


3*0 

1*0000 





8. The special case oe symmetrical,distkibutions; the normal integral 

By placing the origin of the standard variate X at the mean of a symmetrical distribution 
we obviously have/^ =/r,^.i, /g =/r, etc., i.e. the number of unknowns is halved. On the 
other hand, the odd moments contribute the meaningless equations 

^ fii ^ i -^^ R + i - i ) = A/i X 0 = 0. 
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With the number of unknowns and equations halved and with even moments only retained, 
it is necessary to work out a new matrix say) based on even-order moments only. In 
practice the important values of iJ are i? = 4,6, 8 and 10, and we 4re giving below the inverse 
matrix (for = 8) having ranlc 6 (as there are five equations corresponding to 
/li, /ig and fis): 


i fi 

1 A 

2 A 

3 A 

4 A 

5 A 


Afo= 1 


M, 


M, 

0-000 3441 

-0-0017867 

0-0012153 

-0-0002316 

0-0000124 

-0-003 9874 

0-020 8333 

--0'0137163 

0-002 3148 

-0-0000868 

0-0224161 

-0-1190476 

0'0711806^ 

-0-008 1019 

0-0002480 

-0-0884281 

0-5000000 

-0-143 4028" 

0-012 7316 

-0-000 3472 

0-569 6663 

-0-4000000 

0-0847222 

-0-0067130 

0-0001736 


(29) 

Working rule: Each A is obtained by forming the sums of five products using the five ooefftcients in 
the *h line and applying them to Afg.Afg; e.g. A = 0-000 3441Afo -... + 0-000 0124ikrg. 


As an example we compute the normal integral from its first five even moments, /ig to /fg, 
choosing h = \ and the standard variate X as normal deviate. Substituting, therefore, in the 
matrix (.29) Jlfg = 1, = 1, = 3, ~ 15 and — 105, we obtain the five A which in 

Table 4 have been progressively added to form the ‘calculated normal integral’ to be com¬ 
pared with the ‘exact’ one. The accuracy is remarkable, the maximum error being 15 in 
the 6th decimal. 


Table 4. Comparison of calculated normal integral until exact normal integral 


II 

Exact P(x) 

Calculated P(a;) 

Difference x 10“® 

.^4 

0-000 032 

0-000 034 

- 2 

-3 

0-001 360 

0-001 342 

1 8 

-2 

0-022 760 

0-022 766 

-16 

-1 

0-168 6B6 

0-158 643 

12 

0 

0-600 000 

0-600 000 

0 


With symmetrical distributions we cannot, of course, shift the grid of group intervals, 
as otherwise we would lose the symmetry relation between the A. If, therefore, intermediate 
values of P{x) are required in order to ease subsequent interpolation, we can achieve this only 
by altering h. Merging the answers obtained from (say) three different h grids all centred 
at a; = 0 (e.g. h = 0-9, I-O and 1-1), we would not obtain a table of P(x) at an equidistant 
interval. In the internal check we would, therefore, use divided differences.- 


•9. Divergent or poorly convergent moments; the 

t-DISTRIBUTION FOR 10 DEGREES OE FREEDOM 

Some variates with infinite range have distribution functions with low contact at a; = co, 
i.e, the convergence in - 0 (30) 

is slow, indeed, in some cases the moment /ij, is divergent for, say, r > 

As an example we have investigated the ^-distribution for 10 degrees of freedom. Here we 
have/(x) ==: c(l -f and hence i?' 10. In this case, therefore, B' is known a j^non. 

If no Such mathematical information is available, warning of low contact is given by the rapid 
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growth of the moments as r-^R, provided R is near to R'* For our example for the 
^-distribution we find 

/t4-6-25. 78-125, /ig = 2734-375. 

The difficulty with such distributions is that artificial truncation is not justified if the 
high-ordcr, poorly convergent moments are to be used in situations (b). The remedy in such 
oases is the square variate transformation = x. Sometimes it may be necessary to use a 
higher power y* = x. Obviously, if we were to take an equidistant interval for y> the group 
integral for x will groio with the square law, thereby absorbing'the slowly convergent tail 
end off(x). 

Now, obviously, the moments of y are simply related to those of x\ we have 


'00 

0 


x’^fix) dx = 2 


y^’'yf{y^)dy, 


or introducing the new distribution function g{y) = 2yf(y^)> w'6 have 


(31) 


x^f(x) dx - y^’'g(y) dy. 


(32) 


Applying now the previous method to g{y) it is further necessary to avoid using the poorly 
convergent high-order moments. In the case of the i-distribution, instead of taking 
r = 0, 2, 4, 6 and 8, we take the absolute momentsf for r = 0, 1, 2, 3 and 4, which, according 
to (32), correspond to the even moment of g{y). If only even moments about the origin are 
used in the determination of the/j, the matrix (29) gives the appropriate inversion. Using 
h - 0-0 for the y-group interval we substitute in (29): 

Mo=l, 112= 2-401906, ir4= 9-645062, ITg == 62-962032 and Mg = 372-108 863. 
We thereby obtain five values oif^ (i = 1,..., 6) of the form 

m-i)h r(8-i)W 

/i= 9{y)dy-\ f{x)dx. (33) 

J {5-i)7J J (6-i)V 


The progressive sums of these are compared with the corresponding values of the exact 
1-integral in Table 5. Although the accuracy is lower than in the previous example it is 
satisfactory and very much better than we could have obtained without applying the trans¬ 
formation = X. 


Table 6, Comparison of calculated and exact values of the t-integral 


t 

P(f) exact 

P(i) calculated 

BiiToronce x 

5‘7(5 

0*0001 

0-0001 

0 

3*24 

0-0044 

0-0042 

2 

1*44 

0-0902 

0-0905 

-- 3 

0*30 

0-3613 

0-3648 

**-35 

0*00 

0-6000 

0-5000 

' 0 


* If J? is much siTiftllor than. R', the present difficulty will not arise at all. 

t We shall show in a second paper that, if the absolute moments of a distribution are not known, they 
can be obtained by interpolation between the values of log/<, for r = 2, 4, 6. 8, etc.; in fact, we shall give 
a general discussion of the intcrpolability of the logarithmic moment function for positive x. 
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10. Lack of high contact at the start of the variate x=a 


^e confine ourselves here to the most important case of lack of high contact at one terminal, 
iay the start of the distribution, and assume, therefore, that there is high contact at one end 
)/ the range. 

Without loss of generality we assume that a = 0, i.e. a; ^ 0, and introduce the new variate 
f = x>h‘^2. Whence we have 


'b 

xfl{x)dx = 


y'Ky)dy, 


Jo 


Jo 


(34) 


prhere g(p) = Obviously g{y) has, at least, contact of order ^ -1 at the start 2/ = 0; 

further, if f{x) has contact of order m at x = b, i.e. if f{x] = Oipr^) at x = 6, then 
jiy) - Hence there is high-order contact, of order and 

respectively, at both ends of the range. The previous method is therefore applicable to g{y) 
provided we can obtain its moments from those off{x). It is obvious from (34) that in order 
to obtain the ordinary moment of g{y) we require to know the ‘fractionar moments of/(a:), 
i.e.’ those corresponding to r =jjk {j = 0,1,...). If the moments of/(a;) are only known for 
integer r the fractional moments will have to be obtained by interpolation of the logarithmic 
moment function log/i,. which will be more fully discussed in the next paper. 
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APPROXIMATION TO PERCENTAGE POINTS OE 
THE z-DISTRIBUTION 


By A. H. CARTER, King’s College, Cambridge 


Tables have been pubKshedofthe values of« for various percentage levels (20, 6 ,1 andO-l %) 
for a range of given «i, {Fisher & Yates, 1943, Table V). When % or is outside the range 
of the tables, recourse must be had to approximate formulae (unless, of course, interpolation 
is sufficiently accurate) which wiU combine accuracy with facility of computation. One 
such formula, due to Fisher, with a modification suggested by Cochran (1940), is given at 
the foot of the above-mentioned tables. The purpose of this paper is to derive an alternative 
formula, no more difficult to compute, which will be shown to give consistently closer 
approximations to the true value of z for all except small or 

Wishart (1947) has derived formulae for the exact cumulants of z, and also the well-known 
approximations to them when n^, are large. The exact cumulants as far as Xj can be readily 
obtained arithmetically from tables of the Polygamma functions. Knowing the cumulants 
of the distribution, we may make use of the Gornish-Fisher normalization function method, 
based on Edgeworth’s form of the Gram-Charlier type A series (Cornish & Fisher, 1 932), to 
approximate to the percentage points. .The method consists in writing a as an exf^ansion 
in powers of a corresponding normal variate, the coefficients being functions of the 
cumulants of a, and assumes that is of order which is true for the a-distrihution 
(Wishart, 1947, p. 172). 

If a and ^ are expressed in standard measure (i.e. mean zero, standard deviation unity) 
we then derive , Xag ®-1 , 

(T 6 "^ 0 ^ 24 0 ^ 36 ’ 


correct to order for z\ This gives 
Formula (a): 


x| 2g3^5g 
tr® 36 ’ 


( 1 ) 


correct to order n~* (since a = 0{n~'*)), where /z[( = &i), tr®(= Xg), Xg, x^ are cumulants of 
the a-distribution. The ^-coefficients may be readily computed; e.g. for the 6 % level, sub- 
stituteg = l-64486.Table2give3 the values,forthe 20,5,1 and0T%lev6ls, of the coefficients 
required in applying the formula. The quantities /*!(, tr, Xg, X 4 depend of course on % and n^, 
and may be evaluated in any particular case, whence substitution in ( 1 ) gives the appropriate 
value of z. Since (Zi_p(%,'a 2 ) | = [ Zp{n^, j, where zp is the value of z corresponding to 
probability P, to find the percentage points for the ‘negative tail’, i.e. 80, 96, 99 and 99-9 %, 
we may simply interchange and Wg. This has the effect of changing the sign of the odd 
cumulants, so that in ( 1 ) we write - and - Xg for and Xg. 

Formula (a), being an approximation to order nri, maybe expected to give reliable results 
when and are both large. For the 1 % point, for example, we find 3 ( 6 , 12 ) = 0-7843 
(true value 0*7864), whereas 2(24,60) = 0*3744 (true value 0*3746). Some further results for 
( 6 , 12 ), ( 6 , 60 ), and (24,60) are shown in Table 1 (a). 

In practicSj some labour is involved in applying formula (a), even if polygamma tables 
are available. The Fisher-Cochran formula, derived by the normalization function method, 
is a simple working approximation, valid for large n^, in which the exact cumulants are 
replaced by their approximations in terms of inverse powers of and iig. 
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Per¬ 

centage 

level 


Table 1. Compariaon of approximations to the percentage points of z 

Formula (6): Existing formula (Piaher-Ooohran). 

Formula (c): New formula. 



Formula (h) 
Formula (c) 
Truez 

Formula (5) 
Formula (o) 
True z 

Formula [b) 
Formula (o) 
Trues; 

Formula (b) 
Formula (c) 
True z 


6 , 60 24, 60 20, 36 20, 100 36, 60 



Per¬ 
centage nj “> 

level 


Formula (6) 
Formula (c) 
True z 

Formula (6) 
Formula (c) 
True z 

Formula (b) 
Formula (c) 
True z 

Formula (b) 
Formula (c) 
True z 


12, 6 60, 6 60,24 36, 20 100, 20 60, 36 




1*4449 


Table 1 (a). Some values of z from formula (tt) {exact cumulant formula) 

(For corresponding true values, see Table 1) 
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The cumulant function of z is 

ic{z) = +iogr(^^j -iogr(l) -iogr(^), 

and the cumulants are obtainable by differentiating this successively with respect to {it), 
at each stage putting t = 0. 


Since 


■ d/ 
d(it} 


'r (n±U\ 


i=0 




and log r{ln) can be expanded by Stirling’s theorem in inverse powers of n, the cumulants 
may also readily be expressed in inverse powers of n^-, and, when Wg are reasonably 
large, the first few terms only in the expansions will give sufficiently close approximations 


to them. In fact, writing 


it has been shown that 


1 1 

—I— s, 


^ 1 
n. n. 




— Qsd - 4 “ 0(n~'^)y 

/Cg = 0*2 ^ -15-1-1(5^4-^!^) +0(W”^), 
x'3 =-|sd + 0(n,“3), 


(A) 


Ki = Js (s® + 3d®) + C* (»- *) 

/c,, = Oird"’’) (r>l). 

Formula {a) will now have an extra term, since we take as our ‘ working variance ’ of z 
not its exact value, but its approximation to order n~^ from (A), i.e. \s. In the notation of 
Kendall (1945) ^ /Ca/Js- 1 ~ i(.s + d®/ 5 ). 

We then obtain 

' ls\ 


or 


where 


z~Hx' 




i-\{i^ 


\J2I6 


1 + 


1 )+ 

1 ^^ + 3' 


"/±(P + 3£) + ^^(g®+llg) 


2124' 


2/s 12 
d 




lUJ s 


'(P+11^) 


V(^-A) 6 




A = 


( 2 ) 


(3) 


A = 


gH3 


provided j-|jd®/y(2/s) 11^) may be neglected (which will be the case for small d). 

Inserting the approximation to fi[ from (A), i.e. - y, 

^ d 

-~(P + 2), 


'V(ra) 6"'= 

the Pisher-Cochran formula, which has, in fact, been found to give a fairly close approxima¬ 
tion to the true z for n^, both reasonably large. It may be noted that if n,^, are not very 
large, an improvement will be effected by including the second term in the estimate of the 
mean {k^), i.e. from (A) by adding —^sd. 

For (%,n. 2 ) = (6,12) this correction is — 0-00347, and for (24,60), the correction is 
-0-00024. Inserting this improved approximation to fi{ in (3) we have 


Formula (6): 


I 


d 


V(^-A) 6 


i(i'H2-fs). 


(36) 
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As pointed out by Wishart (1947, p. 179), an approximation to the value of any K,{r> 1) 
obtained by considering its leading term only, wiU be improved by writing and 

l/(» 2 -1) ™ by Stirling’s expansion of a factorial, 

” ^'°8l-l + i>og2>r+i + 0(»^), 

and so on. 

Table 2. ^-coefficients required in applying formula (a) 



20 % 

5% 

1 % 

0 -1% 

£ 

*(£^-3£) 

A(2£'>-6g) 

0-84162 

-0-04861 

-0-08036 

-0-08377 

1*64486 

0-28426 

^0*02018 

0-01878 

2*32636 

0*73632 

0*23379 

0*37634 

3-09023 

1-42492 

0-84332 

1-21026 


Thus writing 


8 = 


n. 


T”i-T) 

1 Tin 1 


d' = 




■ 1 


(B) 


we might expect a better approximation to z to be obtained, corresponding to that of Fisher 
and Cochran, if we use s' and d' instead of s and d. 

Corresponding to equations (A) we have 

/c, = -|s'-^s'(s'2 + 3d'=“) + 0(7i-*),l 
= - \s'd' + 0{n-% 

X4 = K(s'2 + 3d'2) + 0(n-5), 

For the mean, however, ii'^ = Ky = —\d- \ad + 0(n~^), (4) 

= -^d' + ^s'd' + 0{n-^), (4a) 

If w® is not negligible (relative to the degree of accuracy desired) should therefore be left 
intheform — —l-sd. 

Proceeding as before, we obtain 


■fh' 




24 




2\g^-7g 
144 ’ 


whence 




d' 




7(11'-A') 6 
n' = 2ls', A' = i(g'-3) 


(5) 

( 6 ) 


where 
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AppTOximcition to p6TceTttctg€ points of the Z’distTibution 

Since this is based in the first place on more accurate approximations to the cumulants 
/Cg and /C 4 , and since the term omitted from ( 6 ) in deriving^( 6 ) (i.e. 

is evidently numerically less than the corresponding term omitted in obtaining the Ksher- 
Cochxan formula (i.e. formula ( 6 ) might be expected to give an 

improved approximation to z. In fact, however, it does not, and the reason is not 
far to seek. 

Consider the expansion of in both oases: 


V(fe-A) ^h\ 


A 3A2 



where the terms are decreasing in magnitude (since Ijh = = 0{n ^)). Hence the error 

in neglecting all terms after the second will be approximately of the order of the third term* 


Now in the Fisher-Oochran approximation this term, + 


4:'h^^h' 


has the same sign as the 


omitted term ^ 1 ^) (both being of the same sign as , so that the extra terms 

included will tend to compensate for the term omitted. In obtaining ( 6 ) from (5), on the 
other hand, the term omitted, — will be of opposite sign to g when 

j g j <^7, corresponding to a probability of about 0-004;: so that for most percentage levels 
encountered in practice, the error in (5) is increased in ( 6 ), 

A better formula is obtained from (5) as 




UihJA' ) 


(7) 


where h' and A' are as in ( 6 ). 

Expanding + it is found that the third term is now of opposite sign to and 
hence the extra terms contained in the expansion will tend to compensate for the term 
omitted. Since s' and d' require to be calculated in applying this formula, it is desirable to 
write in the form (4a) (provided we can neglect quantities of order 'nr^). This gives 

Formula {c): ^^ gVC^^+A ) _d (g2^2-2s')- (7a) 


CoUeoting the results, we have the three approximate formulae: 
Formula {a) (exact cumulant method): 

Formula (b) (Fisher-Cochran formula): 

..i+ 1 , ^= 1 _±, i.i, 


where 


% ^2 


^2 


Formula (c) (new formula): 2- 2^'), 

A 0 ^ 


^ r+— d'= ^ 


■%—1 712 - 1 ’ 


_L_ .,_P-3 

7ti-l 712-1’ “~6“- 


where 



A. H. Carteb 357 

When are large (and not too different), \8d is negligible and formula (6) becomes the 
formula more generally quoted 

5_CJ__11£±^ 

^{h-X) nj 6 


£ 

which may be written z —_ i). 

—A) 'fhj 

Similarly, for sufficiently large %, n^, \a'd' may be neglected and formula (c) becomes 


\^1 ^2/ ^ 

~)(A^ 
W ^2/ 


^xJ{h’ + X') , 

f 1 1 \ 

^*“ + 2 

h' \ 

^xJ{h' + X') 

^%-l n^-l) 
f 1 1 ^ 

' 6 ’ 

|(A' + f). 

h' ' 

K-l ^2-ly 


It is to be noted, however, that since is approximately twice more care must be 
exercised in deciding to neglect it. Tor example, when (^ 1 ,^- 2 ) is (20,100), = 0*0009, 

and for (24, 60), its value is 0*0005. 

For purposes of comparison, values of z haveheen computed from formulae (6) and (c), 
for the four common percentage levels, over a fairly wide range of They are shown in 
Table 1, together with the corresponding true values of 2 . The latter were obtained where 
possible from the tables of Fisher and Yates: elsewhere by inverse interpolation in Tables of 
the Incomplete Beta-Function followed by a logarithmic transformation. Such values are in 
error by not more than 0*0001, It will be seen that neither formula yields very accurate 
results when or is as small as 6, though even here the new formula is rather better with 
the single exception of % == 12, = 6. In actual practice, however,* we are concerned with 

large values of beyond the range of the published tables. Considering only those cases 
where n^ and Wg are both greater than 20, it is seen that formula ( 0 ) gives a consistently closer 
approximation than does formula (6) for both the positive and the negative tails, and for 
all the percentage levels investigated, though its relative gain in accuracy is greatest at the 
1 and 0-1 % levels.. It may be noted, in fact, that in no case considered having % and ^2 
greater than 20, is the error more than 9 in the fourth decimal place, i.e. it appears 
that for all except small n^, this formula wiU give an approximation to /correct to 
within 0*001. 

In conclusion, therefore, it is recommended that formula (c) be adopted for general use, 
since it is no more difficult to compute, and is more accurate, than the existing formula. 
Dropping the dashes we have the formula 

h W-1 «'2~v\ ® 3/ 

where ^ = = 

h \%—1 ’»2-l/ 


If 1 


3lK-l)^ (%-!)■ 



Tk values of J and il for the four percentage levels are: 



n 


l"/o 


I 


1'6«9 


mm 

1 

-fr3819 

0'M91 
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Note on the cumulants of Fisher’s z-distribution 


By L£S0 a. AROIAN, Hunter College 

In a recent article Dr J. Wishart (1947) stated: * Explicit expressions for the exact onmnlants of Fisher’s 
2J-diatribution do not appear ever to have been published.’ Fisher’s z-distribution and the related Snede- 
oor’s jp’-distribution formed a part of my doctor’s thesis and rather full results concerning the cumulants 
of the z-distribution and other properties of the distribution were published in the Annals of Mathe¬ 
matical Statistics (Ajoian, 1941) some time ago.* I should like to take this opportunity of adding certain 
comments on the Gram-Charlier Type A approximation to the z-distribution and the type III approxima¬ 
tion to the JP’-distribution. 

To obtain the cumulants of the -distribution I expanded the moment generating function M^[d) in 
powers of 6 and found the A;th semi-variant (or cumulant) of 2 as the coefficient of O^lh !. The exact 

results correspond with Wishart’s formulae (9) to (16), although given in a different form, and need not 
be repeated here. In addition, asymptotic formulae for A*,.*, and large, were derived by means of 
the Buler-Maclaurin sum formula. Furthermore, another type of formula could have been ^ven for 
ni small but large, merely by expanding that part of As-., in which occurs by the Euler-Maclaurin 
sum formula. The special cases for the logarithmic the logarithmic and the logarithmic normal 
probability functions follow by substituting the proper limiting values of and 

In my previous paper I was overcautious concerning the type A approximation to the 2-distribution. 
Actually the method is fairly accurate although tedious. Tal^g 


we have 


F{t)dt = % 

J k 

ffX) fco 

I T'{t)dt^ I <j>{t)di-{’f>{t){^ — - 43(^0 — 1 ) “f ^ 4(^0 — 3 ^ 0 )}, 
J /o ^0 


where 7} is usually 0»10, 0*06, 0*026, 0*01, etc. As an example take % = 24, ^2 = 60; then 
Ai,, = -0-0127429, cr, = 0-173779, A3., = -0-0007998, A^., = 0-0000867, 




=0'00396. 


^0 for 17 = 0-06 is 1*60094, 2o*o5 = 0*26647 agamst the accurate value of 0*26634. For the 1 % point 
Iq = 2*2338, 2o^oi “ 0*3764 agamst the accurate value of 0*3746. When rii n3 — 24, 20.05 = 0*3423 
against the accurate value of 0*3426. 

The typo ni approximation to the J’-distribution is of some interest since for moderate and 
large, rixF tends to be distributed as with degrees of freedom. Since 


Mean i?’ = JP = — 

n2-2 

_ 4(2ni4-yta-2) / 
ni(n2-'6) V 



%(ng-4) 
2(ni+n2 —2)’ 


2(ni + ?ta~2) 

ni(n2-4) 


we find the 6, 1 or 0*1 % points for F by using 

F’ 0.35 = F^+crj,(l*64486-l-0*28392aa.^-0*04902a|.jp,), 

•^" 0.01 = J^+o>{2‘32636-h0*73330a3,^-0*024967al.,), 

F 0.001 = J^+a>(3-09034-l-4190a3., + 0*05667al.,). 

* [Both Dr Wishart, as author, and myself as editor regret that owing to yrartime preoccupation the 
publication of Dr Aroian’s 1941 paper was overlooked. E.S.F.] 

Biometrika 34 
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These formulae for the levels of significanoe of the distribution are from a previous paper (Aroian, 
1943). I’or = 24, ng == 60, Pq-os approximation is 1*709 compared with the accurate value 

of 1*700. For rii = 24, = 100, J’o-os approximation is 1'631 against the accurate value of 1-627. 

For ni = na = 100, by this approximation is 1-394 as compared with the accurate value of 1-392, 
While these results are not too poor, they are not so accurate as the well-known formulae of Cochran- 
Fisher or of F. Paulson (1942) which, for large values of and generally give 4 signidoant figures. 

ItEFEBENCES 

Aboian, L. a. (1941). Ann. Math. Statist. 12, 420, 

Aboiajt, L. a. (1943). Ann. Math. Statist. 14, 93, 

Paulson, E. (1942). Ann. Math. Statist. 13, 233. 

WiSHABT, j. (1947). Biometrihaf 34, 170, 


A note on the mean deviation from the median 


By K. B. NAIB 


For samples drawn from a normal universe, Godwin (1945) obtained the sampling distribution of tho 
mean deviation when the individual deviations are measured from tlie sample mean. It is well known that 
the mean deviation is least when it is measured from the sample median.'^ Let us refer to them as * mean 
deviation from .mean* and *mean deviation from median* respectively, and use the letters m and to 
denote their sample estimates. 

The exact sampling distribution of m being now known and its probability integral tabulated, the 
question may well be asked what the distribution of m' is, Since m' their expectations have the 
same relationship (1) 

For samples of n from a normal population with standard deviation,, cr* and E{m) = 

whereFor getting unbiased estimates of cr we should divide m' hy and m by/^. What wo are 
now interested to know is which of the two estimates has a smaller standard error. In the case of ni, it 
has been shown by Ilehnert (1876) and Fisher (1920) that 


A- 


j 


2 (n--l) 
niT * 


( 2 ) 


and 


^ / w\ O' jrin , , 

(7.) 


•2)]—n+sin~^ 


J__v 


n *-11 


(3) 


In the case of m/, we neither know nor the standard error of (^7/1) samples of size n. 

It is obvious that when n is very large, the mean and median will differ very little from one another 
and hence m' and A It is interesting to note that, at the other end of the scale, namely, when 

n = 2, tn and m* are identical, and equal to one-half tho sample range. 

To discover any real difference that may exist between the standard errors of (m/Z^) and (w7/rt)> 
which is the same as determining the difference between the coefficients of variation of m and wo must 
consider samples of size greater than 2. 

(i) Let us take n = 3, and let aja, be the observed values arranged in order of ascending magnitude. 
We at once find that / ,, v 

(4) 

The distribution of m' for samples of 3 is therefore derivable from that of tho range. The probability 
integral of the range has been tabulated by Peamon & Hartley (1942) forn = 2 to 20. For our purpose 
it is necessary only to know the values of the mean range (w) and the standard error of the range (cr^) 


* When rii the sample size, is an odd number, the sample median is by definition the value of the 
^(n-f l)th ranked observation. When n is even, the sample median is conventionally taken as the mean 
of the -Jnth and ■|(?ir'i-2)th ranked values. The mean deviation from the median will have the same 
magnitude whatever value, between the Jnth and i(?i+2)th ranked values, the median takes, when 

n IS even. No cninnlip.fltinTi ia f.UArofnra n J.1. * __1 - .r tl._ 
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for samples of 3. This can he calculated, correct to six decimal places, from certain numerical values 
given by Pearson (1926). Using his figures. 


w = 1-692568 X (T, 


£r„ = 0-888368 x cr. 

The value oF/n for sample of 3 is, therefore. 


( 6 ) 

( 6 ) 


/a' = J X 1-692568 = 0-56419, 

0*888368 

and the standard error of w'/// is --—-cr = 0’S2486o', m 

1*602668 ^ ' 

correct to five decimal places. 

The corresponding values fox /a and standard error of (m/Zg) are obtained by putting n = 3 in 
equations (2) and (3) and are 

A 


V 37T 


= 0-65147, 


(8) 


and 


s.B. of (m//s) = + = 0-62486cr, 


(9) 


correct to five decimal places. 

Although (9) can be evaluated to any number of decimal places, we are not in a position to bring (7) 
to a higher order of accuracy than five decimal places. It is very unlikely that (7) and (9) are absolutely 
identical, but we may safely conclude that they are practically the same. 

(ii) We next come to samples of 4. If cug, ai^ be the observations arranged in order of ascending 
magnitude, the mean deviation from median is given by 


( 10 ) 

The distribution of follows immediately from 'some order statistic distributions for samples of 
size 4’ obtained by Walsh (1946) and is as follows: 

p(m')dm' = e-iv‘dijj dm'. (11) 

The probability integral of m' is given by 

rm* { 3 / 2 „ \3 

P(,.-) = J ^ y(m-)fc- = (J ^ *) • 

The values of F{m^) given by (12) can easily be evaluated using the normal probability integral table 
and are given in cols, (3) and (6) of the table below, alongside corresponding values (given in cols. (2) 
and (6)) for the probability integral of the mean deviation (w) from the mean, for samples of 4, copied 
from Godwin’s (1946) tables. 


Tahh giving the prohahiliiy integral of the mean deviation from (a) mean and (6) median 
for samples of four observations from a normal universe (of = 1) 


m (or w') 


P(m) 


P(m') 


m (or m'] 


F(m) 


P(m0 


0*0 

0*1 

0*2 

0*3 

0*4 

0*5 

0*6 

0*7 

0*8 

0*9 

1*0 

1*1 

1*2 


0*00000 

0*00333 

0*02634 

0*07879 

0*16693 

0*28345 

0*41652 

0*54836 

0*66934 

0*77040 

0*84860 

0*90602 

0*94321 


0-00000 

0*00398 

0*03003 

0*09204 

0*19139 

0*31818 

0*45629 

0*58951 

0*70692 

0*79964 

0*86962 

0*91888 

0*96162 


1*3 

1*4 

1*6 

1*6 

1*7 

1*8 

1*9 

2*0 

2*1 

2*2 

2*3 

2*4 


0*96758 

0*98229 

0*99073 

0*99634 

0*99776 

0*99896 

0*99953 

0*99980 

0*99992 

0*99997 

0*99999 

1*00000 


0*97229 

0*98475 

0*99192 

0*99588 

0*99798 

0*99905 

0*99967 

0*99981 

0-99992 

0*99997 

0*99999 

1-00000 
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We note that although m and m' have an infinite range from 0 to oo, their probability integrals rapidly 
approach unity, this value being reached to five decimal place accuracy when m(m') = 2-4tr. We can 
approximately work out the moments of the two distributions from the table above. The values of the 
mean and the standard deviation (applying Sheppard’s correction for grouping) of m and m' so obtained 


(13) 


(14) 

showing close agreemeut with the values given in (13) for the mean and coefficient of variation of m. 
We may therefore consider the mean and coefficient of variation of m\ approximately evaluated in (13), 
to be of sufficient accuracy to warrant the conclusion that, for samples of size 4, the mean deviation from 
the mean leads to a more ‘efficient ’ estimate of the population standard deviation than the mean devia¬ 
tion from the median. As the distribution of the latter is not known forn> 4, we are not in a position to 
say whether this conclusion holds good, in general, for all values of n. 

In conclusion, it seems worth making the following point: 

(а) if expressions for the expectation and variance of m' were available and tables of its probability 
integral worked out, 

(б) if the efficiency of the m' estimate compared to the m estimate for n > 4 was not appreciably worse 
than for the case n = 4, 

there would be strong practical grounds for using m' rather than m in view of greater simplicity in calcula¬ 
tion, In both cases we must first arrange the observations in order of magnitude. Then if 
may be calculated from the formula 

•m' = + *n-(+2 + ...+»„)- (®i + + ... + »()}, (16) 

Tb 

where i -Jw or 1) according as n is even or odd. 

For m, however, we must also calculate the arithmetic mean x and look for Xj, and between which 

W lies. Then m can be obtained from one of the three formulae 


Mean: m 0-690986(r, m' = 0-6a3187cr, 

Standard deviation: ~ 0*2970l5cr, = 0'292979cr, 

Coefficient of variation: = 0*429842, oTrn’l^^ ~ 0*441776, 

The values of w and obtained from the exact formulae (2) and (3) are 

w = cr J ^ 0*690988cr, 

+ = 0*429842, 


nm __ _ rcj+ajj+ 

2k{n-k) n—k k 


(16) 


This certainly involves a rather longer process. 

It is interesting to note that m' becomes a special case of the measure of dispersion baaed on difference 
between the sums of the first and the last r observations (in order of magnitude) suggested by Jones 
(1946), the range, becoming another special case of the same measure, when r = 1. 
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On the method of paired comparisons 


By P. A. P. MO PAN, Institvie of Statistics j Oxford University 


M. G. Kendall & B. Babington Smith (1940) have discussed the ‘method of paired comparieons’ for 
investigating preferences. Suppose we are given n objects A, ...jK, and an observer is asked to choose 
between every pair. If A is preferred to B we write A-^B, If the observer is not completely consistent, 
either because of his own inefSciency or because the objects are not really capable of being ranked in 
respect of the quality under consideration, he may make preferences of the type A ->■ JB C A, and we 
call this an inconsistent or circular triad. Write d for the number of circular triads in a given e^^eiiment. 
Then Kendall & Babington Smith show that 

S = 1 —(«■ odd) 

= 1 - -r—j- n even) 


may be regarded os a ‘coefficient of consistence’ and lies between 0 and 1, being capable of attaining 
both these limits. 

Now suppose that eaoh comparison is made at random so that there are equal chances that A 
and B^A. The distribution of d is then of interest. They calculate this distribution exactly for 
n = 2,,.7 and conjecture that its moments are given by 





these being polynomials in n which agree with their numerical calculations for n — 2.7. They also 

conjecture that the distribution tends to normality when n increases. In the present note we prove 


these statements. 

Let the objects be numbered from 1 to n« Write 1 ^1^® triad {itjt h) is cirotilar, and Pnh^ ^ 

if it is not. Then d = tak&n over aU such triads. Now by enumerating the various 

oases we see that J and so /ii(d) = E(BPiiJe) = J (gj* Now consider — EliPPm)^]* 


Consider the types of terms which results when wo expand this. Pi the ffist place we have 


terms 


typified by Pjag, and these contribute J Similarly, we have terms typified byPij3pi46» 

PimPin and PtnPiw and the number of these are respectively ^ (n- 3) (»-4), (n-3) and 



, whilst their expectations.are each -5^. It follows that 
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The calculation of is a good deal more complicated. n'^{d) = and on expanding we 

get 16 types of terms, typified by 

P'las, ^123^146’ .^123^184' 


pa p 
1143-*- 4Ba» 


^ 123 


125 > ^ 123 -^ 146 -^ 466 » 


123-*^146> 

’l23-^ia4^126> 

134 » 

^ 123 ^ 145 ^ 678 ^ ^ 123 ^ 124 -^ 607 > ^123^i5Q^769* ^ 183 ^ 345 **- 242 

After some calculation wo find the sum of the contributions of these to be 


IISJ 


^123^S45^667» 


^nz^uB^uet 
246-^84a> 


— /”'i (n»- 0»'+ 13n« + 42w=- 158n2- 108n + 864}. 
2304 \3/ ^ 


Taft) 


Eeducing to the mean we get ^ 

The calculation of ju.^{d) is a great deal more complicated, there being 85 terms which are not zero; 
we finally obtain, after lengthy calculations, 

{n«-9»« + 33«’4-45n'=-682n=-i-504wi+5732n=- 10C92n2~30024n+80352}, 


and so 




1 /n 
55296 13 


{972n3 + 972ri2-^3693Ga+80352}, 


which reduces to the conjectured result. 

We now prove that the distribution tends to normality, 
p. HO) to prove that 


fijm ^ (2w)! 

fim 


/^8m+l 

^i(2m+l) 


0, for 


To do tliis, it is suilicient (Kendall, 1943, 

= 1 , 2 , 


Consider the second of these first. Write — J. Then 


It is clear that for any given m we could calculate given sufficient labour, by expanding this 

and considering the expectation of each type of term and calculating the number of times it occurs, which 
will be a polynomial in n. How consider the various types of terms in the expansion of Wo 

classify these terms according to whether the Q’s ha\icj common suffixes. Let QijkQtmn'-- Qv<tr ^ 
typical product in the expansion. If this can be separated into p groups of prod\icts of such that 
different groups have no common suffixes whilst within each group the triads are connected to each other 
by having common points, we shall say such a product ‘contains p groups ^ Moreover, tho number of 
times such a term occurs will be a polynomial in n whose order is equal to the number of distinct suffixes 
occurring in. the product. If in a group a suffix only appears once, the inconsistency of the triad containing 
it is unaffected by the remainder of the group and the expectation of the product of Q’s in that group will 
be zero. It follows that in all those terms which contribute something non-negative to i^one of 

tho groups can contain a suffix which appears only once. Therefore, since all tenns which contain more 
than m groups will have at least one group consisting^of a single Q, tho expectation of such tonns will 


be zero. It follows that /igm+ii® ^ polynomial in n, of degree 3/n-f 1 =; integral part of 1 )J at 

most, whose coefficients depend on m only. But p 2 {d) is of degree 3 in n and so 








Now consider This is apolynomial in and our aim is to find the order and coefficient of the term of 
largest order, In the first place wo need only consider terms with m or less groups, for if a term has more 
than m groups, one at least will consist of a single Q and the expectation of the term will be zero. More¬ 
over as before, iu each term, the suffixes in each group must each occur at least twice in that group. The 
number of times each type of term occurs will be a polynomial in ti of order equal to the total number of 
distinct suffixes in that term. As we shall show the leading term in to be of order 3w, we can neglect 

terms whose frequency is less than this and therefore we can neglect all terms in which a suffix appears 
more than twice. Now consider a term with fewer than m groups and tliereforo containing a group of 
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order greater than two in the Q s. As no suffix can occur more than twice, no Q can occur more than once. 
Consider any , of this group. Then either the suffixes t, fc are common to three other triads or 

one, t, aay, is comnaon to another triad and J, k common to a third. In either case evaluation of the 
expectation shows it to be zero. We can therefore restrict our attention to the case where there are w 
groups each containing two triads. Such groups can only be of the form QJjj, QmQus and the 

expectations of the two latter are zero whilst the expectation of §123 is 




The number of groups is m and the number of ways of choosing m such distinct pairs out of 
( 2771 ) ! 

is --- so that the leading term in is 

I o r 3SW 


(2m) I 
¥^m\ 




whilst the leading term in is and so 




The distribution therefore tends to normality. 


( 2 m)! 
2”^m! * 
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Notes on the calculation of autocorrelations of linear 
autoregressive schemes 


By M. H. QUENOUILLE 


X. Bartlett (1946) has recently shown how, for a series of observations, we can test whether the 
observations can be adequately represented by a linear autoregressive scheme 

Mn+j + %Mn+j-i+,.,+aiU„ = (1) 

where the are known or fitted values, and is an error component independent of Bartlett’s 

test is based on the formula ^ ^ 

oov {r„ ~ -- S 

where r, is the estimate of the true autocorrelation between and 

00 

The purpose of the note is to demonstrate how, using generating functions, />,- and S PiPi+t 
bo calculated with the minimum of computation. 

2. Themethod of gonerabingfunctionsseemsto have been used by Wold{1938),who applied them to 

finding tho variances and covariances of linear forms of finite extent in variables such as We shall, 
however, be concerned with linear forms of infinite extent., 

It can easily be shown that the solution of ( 1 ) can be written 

For example, if Wn +2 + == ®n+a» 

sin 2d sin Sd 

we have (l + ai + fei®)-! = (l~ 2 rrcosd + aj“)-i = 1 +-^® + -^^:): +..., 


where 


COS0 = — ■ia/,^yb, X == t^jbf 

bt Y\ 

* sin0 ,^(4b-a2) 


and hence 
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3 . Using this generating function, we have 

S /)(«'= + + + + IJ • (4) 

Now the expansion of (4) can he achieved by splitting into partial fractions and, in general, we can let 
0-^1 before this operation is performed. Thus 

u4.q 4*'^i ^ "i" • * • 4” 1^"”^ i3^ *■)”*T • 4* 

(14-%^ +... 4-a^^^) + *♦* 4-a^) 14-ai^4* ••• 4*a^ ’ ^ ^ 

and using pf == —p^{, we can see that 

{^^0) 

= Bi-afB^ja^ {i==^ 

Thus the autocorrelations will be generated by 

i Bi-\-B^t-{‘BjP^\ _1 4 -+... 

Aq 14* 4*. 4.4"%^^ A^t l4'Gtii^'“^4’***4*^^^~'^ 

where the first term is expanded in powers of t and the second term is expanded in powers of 


4. The expression (6) can now be squared to give a generating function for 2 pipi^t* It will be 
necessary to split ...ArBs) 

(14" % ^ 4“ • ■ ■ 4' 4“ 4* ‘ ♦ 4“ 

into partial fractions, but the labour will be reduced since the matrix of the ooefficients of the equations 
in will be unaltered. 


6 . To illustrate the method, we can consider KendaH’s series 1, which was used by Bartlett in his 
example. 

The autoregressive scheme for this series is 


so that 


'^n+2 4' 


£ Pit^^ 

-isss— 00 


(2 

(1 - l-U + 0-5fi) + 


2Bg + (B^ + 2■2Bl)t B, + Bit 


where 


Thus 



2*1164’" 

1*442,3 


“0" 

1 


0-2 2*8846, 


“ . , 0*7333*-0*6t 1 0*7333 

S == 14*^-__ 

/Oi-l‘lp^^l4-0*5p;„4 s: 0 (^>0). 

If we now consider the square of the expression (7) we have a product term 

2i(0-7333-0-B()(0-7333e-0 -B) -2B, + (B,-h2-2B,)t Bg+jSji 

(l~l‘lt + 0-Bt^)(t^~hlt+0-5) l-l-K+O-Bt^ '''(’‘-l-U+O-B’ 

rBi"! = r-3-7692 2-11641 T-0-73331 

LbJ L-2-1154 -1-4423Jl 1-5764] 

0-66861 

-O-7210J’ 



and 
go that 


(V 

( 8 ) 


where 
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and, if we write 


■P«= S PiPH, S Pi 


-S S , „ 0-7333-0-6« 

■(wa~aj 00 1— l‘lt-f0*6i® 


p-7333-0-6{y 


, , 0-6686-0-7210J 

+ 1.4420+t-~~^^-^_ +terma in 

= 2-4420+ 1 ^ p. 0-5377-0-7333i-f 0-26«“ 


l-l-l<+0-6<2 

+ terms in 


(l-l-li + 0-6«®) 


(9) 


From this we have S p?= 2-4420 and the ‘correlations’ P, of the correlations are 0-8334, 0-4321, 

— ao 

0-0006,.... Successive terms may bo calculated using the relation 

P,-2-21P,_i4-2-2P,_,-l-iP,_3 + 0-26P,^ = 0 (i>0). (10) 

CO 

The caloxilation of S suggested by Bartlett, caai also be made by this method, but it is more 
arduoxis, and the first few terms will give a good approximation. 

7. The same method can* be used to calculate the appropriate number of degrees of freedom for 
testing the correlation between two linear autoregressive schemes* 

n 


In general, if = pijcr'^ and r ; 


S 
-1 


then 


varr« 


n n 

S S Pa Pii 

n n ‘ 

S Ph S p'ii 
i=l i«l 


I / n n \ * 

71 , 5 , 


For linear autoregressive schemes, pi j = and thus 

^ + + pn-lPi-l 


varr/v 


' S Pip'th- 

i«“0O 


/ ^ 

Thus, provided n is large, r can be tested with n S PiPi degrees of freedom, and the calculation 

/ -^s=j— 00 


of S pi Pi can b© made by the above method* 
—00 


8. Finally, it is worth noting that, for autoregressive schemes involving m observables, it is possible 
to extend this method by the use of parameters to calculate the-correlations within and between the 
observables, provided that adequate estimates of the coefficients of the equations are available. In 
practice, however, the procedure will often be reversed, and estimates of the coefficients of the auto- 
rogressive schemes will be obtained by equating the theoretical and observed correlations. 
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Approximate formulae for the percentage points of the Incomplete beta function 

and of the x® distribution 

By D. HALTON THOMSON 


Valuable ‘Tables of percentage points of the Incomplete Beta Function’ have been published in Bio- 
metrika (Thompson, 1941 a) giving numerical values of percentage points at various probability levels 
between F =: 0‘996 and P = 0*005 for degrees of freedom = 2q and = 2p ranging up to 120, and with 
an accuracy of five significant figures. In the same volume, a ‘Table of the percentage points of the 
distribution’ was also published (Thompson, 1941 &) for values at the same probability levels and degrees 
of freedom ranging up to = 100, and with an accuracy of six significant figures, thus supplementing the 
table of that function originally due to H. A. Fisher (Fisher & Yates, 1938). 

Cases arise in practice where the tails of the frequency distribution of a large population are of special 
interest, thus involving (in the case of the beta function) values of 2p larger than 120, with a small 2q^ 
or vice versa. Harmonic interpolation between 120 and infinity, however, leads to substantial errors, 
as is found when the values of tho percentage points x are expre>ssed in terms of their tail values (a? or 
1 —a;<0*6). This Note shows that close approximations to such extreme values may bo determined by 
using the table as an auxiliary table to extend the Beta Function Tables in conjunction with certain 
simple alternative formulae. Comparisons within the range of tho published Beta Function Tables are 
made indicating the degree of accuracy within that range. Tlie accuracy of these formulae beyond that 
range increases rapidly with increasing 2p and decreasing 2g (and vice versa), so that they can bo applied 
with confidence under such conditions. 

The ‘normalised’ form of the Incomplete Beta Function, in the usual notation, is 




r(p + q) 

np)m 


/: 




(1) 


in whioh, for a given P, 1— p) in the tables denotes the upper percentage point and x{p, q) the lower 
percentage point. 

It is known that, whenp is large and q is small compared withp, this form tends towards the Incomplete 
Camma Function „ 

P = 

r{q)Jo 

where (c{pyq) e“*. This in turn may be transformed to the x^ distribution by putting pt = [xJ(P)]/2, 
For a given large p and small g, therefore, the percentage point in terms of x^ is given approximately by 

a{ 2 p, 23 )S 6 xp[^-^^J, (2) 

whore 2^ = r in the x^ table. This expression gives the exact value of x, when 2g| = 2, but for larger 2q 
the error, which is consistently negative, increases rapidly with increasing 2q unless 2p is vc^ry largo— 
much larger than 120, It is, therefore, of limited practical use. The following modifications wore in 
consequence evolved. 


Appboximatiok a 

Consider the constant of integration in the original form (1) which, when expanded, is 

m 

Let the terms g-l,g — 2,1,0 be averaged; the constant as a first approximation then becomes 

{p + 4(g-l)}^ 

m • 

The numerator suggests that a more accurate approximation for x would be obtained by substituting 
P + i(g -1) in place of p in (2), thus leading to 

x{2py 2g) ^ exp r - — — 1 * 

L + 


(A) 
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A comparison of the approximate values of x obtained from (A) with the exact values in the Beta 
Function Tables, for all probability levels between P = 0-996 and P = 0-006, shows that: 

(а) The error is consistently positive, but much smaller than the negative error in (2); in other words 
the latter is slightly over-corrected. 

(б) Foragivenp/gandvaryingP.theerrorisnearlyconstantjitissmallestatP = 0-996 and increases 

gradually in. the direction of P = 0*005. 

(c) For a given the error decreases rapidly with increasing 2p and/or decreasing 2q. 

{d) Provided that pjq is larger than 4, the yalno of x is within 0*5*% of the exact tail value; if pjq is 
larger than 10, the error is within 0-1 % of that value. 


AppnoxiMA-TiON B 

The exponent in (A) may be written 

_ Ag.(-P) 

%)-l-g-l 2q 2p-l-g-l 


T&.{P) 

2 

2q 



The factor, in square brackets is equivalent to the first term in the loiown expansion of the form 

where n = l)/2g, which converges rapidly when n is large; i.e. when 2p is large compared 

with 2g. The above exponent may therefore be written 


which, when inserted in (A), leads to 


AiP) 

2q 


log 


\2p + 2g-l/’ 


\2p‘^2q- 
I 2p^l 


(B) 


where h = [;\|a(P)]/(2g). 

A similar comparison with the Beta Function Tables, for the same range of probability levels, shows 
thatj 


(a) Approximation (B) gives generally more accurate values than (A), except when 2q is very small, 
in which case they are nearly identical. 

(b) For a given pjq and varying P, the error is negligible in the vicinity of P = 0*26; it increases 
negatively in the direction of P = 0*995, and positively in the direction of P = 0*006, the largest errors 
occurring at this level. 

(c) For a given P, the error decreases rapidly with increasing 2p and/or decreasing 2q, 

(d) Provided that (2j5)7(2gf)2 is larger than about 160, the values of x are within 0-6 % of the exact 
tail value; this implies that if 2p is larger than about 160, this degree of accuracy is attained even when 
pjq is as low as unity. lf(2p)^l(2q)^ is larger than about 2000, the error is within 0*1 % of the exact value, 
which implies that if 2p is larger than about 120, this degree of accuracy is attained whenplq is as low as 4. 

It will be observed that, when 2g - 2, the formula does not revert exactly to (2), as is required by theory; 
but, unless 2p is also quite small, the error in the computed value of x is negligible. 

The expansion of (B) leads to 2ff).S e-o -«(1 + |s)-f av\ 


where 


A(-P) 


thus demonstrating its analogies with Campbell’s 


q 

and a = r---r, 

2^ + 2g-l 

formula (0) below. 


Adaptation op Oampbell’s pormtjla 

In a book concerned primarily with quality control, Simon (1941) quotes (without the proof) a formula, 
due to Campbell (1923), designed to determine the average number of defectives in a sample of n, starting 
from the known average number in an infinite sample. It is a particular application of the general 
problem now under consideration, namely, the approximate determination of the percentage points of 



370 


Miscellanea 


the Beta Fxmotion, starting from the corresponding known values for the form of the Poisson exponen¬ 
tial binomial summation. It is given in the following form: 


=:^n-i-t-^[1442 + (3a + 2)^-(-a]M-*-f.... (3) 

a(c, CO, F) 

where a(c,n, P) ^ average number of defectives in which P is the probability of at least c defectives in 
asample ofn, a(c,oo,F) := average number of defectives in an infinite sample,^ = 1), in which 

a = a(c, 00 , P). (Simon quotes a == (a, oo, P), which is an evident misprint.) 

If Q denotes the value given by the formula, then 

a(<j,n, P) =; a(c, oo, P) (1 + G), 

90 that 1 + <3 ia the factor by which the average number of defectives in an infinite sample must be 
multiplied to give that in a sample of n. 

The change from Campbell’s notation to the more familiar general notation is given by 
o(c,n,P) = {l’->x{2p,2q)}n, a - a(c,(X},P) == [?;3a(P)]/2, 
where n ^p + q^l* and c q. 


Let u = ajn and r = (c— l)/(2n), 

then A ^ n(r-"'w/2). 

By inserting this notation in (3) and rearranging the terms, the formula leads to 

£r(2p,2g)^ 1- + 

which expression includes the first four terms in the expansion of 

Hence, for the determination of the percentage points a;(2p, 2^), Campbell’s formula may, in effect, 
be re-written as 

a;(2p, 2q) ^ - (0) 


where 




and r 


g-1 

2(P+ ?-!)' 


For large 2p and small 2q, the last two terms become negligible, in which case it reduces to 

a;(2p,2g)^e'““—r(l-h|^r)w. (O') 


COOHBAIT’s APFB03CmAa?IOK 


Cochran (1940), extending a method of Fisher’s (1925), baa introduced a useful approximation for 
the percentage points of the Incomplete Beta Function, whm both p and q are large, his method being 
to determine a sufficiently accurate value of z, as used in Fisher’s ^-transformation. 

If y is the normal deviate at probability level P, then for a given pair of arguments 2p, 2q, the following 
are first calculated, using Hartley’s (1941) notation: 


A = i(y® + 3), 
y I 

~M-^) 


2p-b2g'^ 

(A-^)(.4-2p) 

pA 


Hence, by Fisher’s transformation, 


x{2p, 2q) ^ 


2p 

2pT2^* 


(B) 


CoMPAmSOlT OF FORfinmAS 

Table 1 compares the various formulae for upper percentage points at an extreme probability level 
(P = 0-996). Table 2 indicates their relative accuracy on a common basis, namely, as a percentage of the 
exact value of cc or 1 - a;, whichever is the smaller, so that the deviations from the exact values, when 
a; or l—a? approach zero, are duly emphasized. For intermediate probability levels, the percentages lie 
between the tabulated extremes. It will be noted that in the case of approximation B the errors pass 
through zero near the mid-range of Pj in the oases of A and C the errors are positive for all values of P. 
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The general conclusions from these tables and other comparisons are that, for a given probability 

(а) When about 6, approximations A, B and C have about the same degree of accuracy, so that 
the simpler, A or B, have the advantage. 

(б) In the range 6 > p/g > 4, there is little to choose between B and C; bnt B is the simpler. 

(c) When ^?/g<4 and the distribution approaches symmetry, B gives the best results, provided that 
2p and ^ are moderately large, say > 60. It may be, however, that B in this range will be sufficiently 
accurate for many purposes; if p/g > 2, the maximum error of »is about 2 units in the third decimal place. 


Table 1. Canvpariaon of approximaiefommlae at a gi/ven probability level 

P ^ 0 996 


2p 

2q 



x{2pi 2q) 



A 

B 

C 

(Campbell) 

D 

(Cochran) 

Exact 

120 


0-9*16461 

Nil 

0*9*16459 

^O^OmOQ2 

0-9*16461 

Nil 

0-999862 

-0-000054 

0-9*16461 



0-9982908 
+ 0-0000001 

0-9982906 
--0-0000001 

0-9982907 

Nil 

0-997926 

-0-000365 

0-9982907 


10 

0-982764 

4*6-000605 

0*982766 
~ 0*000004 

0-982760 
+ 0-000001 

0*982076 

-0*000683 

0*982769 


20 

0-944002 
+ 0^000072 

0*943893 1 

-0*000037 

0-943941 

0-000011 

0*943366 1 
-0^000564 

0*943930 


30 

0*902230 
■^0^000280 \ 

0*901839 

-0*000717 

0-902000 
+ 0-000050 

0*901651 ^ 

-0^000399 \ 

0*901960 


40 

0*86160 1 
+ 0*00007 

0*86070 

-0*00023 

0-80106 
+ 0-00013 

0*86066 

-0*00027 

0*86093 


60 

0*78782 
0^00203 j 

0*78622 1 

-^0^00057 

0-78621 
+ 0-00042 

0*78668 

-0*00077 

0*78579 


120 

0*62698 
+ 0*00075 

0*61430 

-0*00700 

0-61866 
+ 0-00235 

0*61620 

Nil 

0*61620 


N.B. The figures in italics are the differences between the approximate and exact values. 


Table 2. Belative accwa^yy of approximate formulae 


Error of a;(2p, 2g) expressed as a percentage of the smaller exaot tail value (a? or 1 - a; < 0*6) 


2p 

2q 


A 

B 

c 

(Campbell) 

I) 

(Cochran 




p/K 

0*996 

0-600 

0*006 

0*996 

0-600 

0*005 

0*996 

0*600 

0*006 

0*996 

0*500 

0-006 




% 

/o 

% 

% 

% 

% 

/o 

% 

/o 

/o 

% 

% 

120 

12 

10 

♦ 

* 

♦ 

♦ 

♦ 

* 

* 

* 


-2*8 


iiil 

20 

6 

+ 0*1 

+ 0*2 

+ 0*2 

-0*1 

* 


* 

* 


.^1*0 


IsQ 


40 

3 

+ 0-6 

+ 0*6 

+ 0*7 

-0*2 

* 



■sn 


-0*2 

♦ 



60 

2 i 

+ 0*9 

+ M 

+ 1-2 

-0-3 

* 

+ 0*2 

+ 0*2 



-0*1 


if 


120 

1 

+ 2-6 

+ 2*7 

+ 4*4 

-0*6 

* 



mi 

+ 1*7 

♦ 

if- 


30 

3 

10 

♦ 

’ - ♦ 1 


* 

♦ 

* 

* 

♦ 


- 30*0 1 

-1*4 

-7*1 

6 

6 

+ 0*1 

RiSI 


-0*1 

-0*1 

+ 0*1 

* 

♦ 


-111 

-0*5 

-1*8 


10 

3 

+ 0*4 

+ 0*5 

+ 0*8 



lESEl 

■SSI 


+ 1-3 

-2*2 

-0*1 

KH 


16 

2 

+ 0*8 

+ 1*0 

+ 1-9 


-0*1 

+ 0*6 

ISPI 


+ 2*8 

-0*7 

if 


1 

30 

1 

+ 2*4 

+ 2*7 

+ 7*1 


-0 1 

+ 1-8 

m 

+ 0-6 

+ 8-2 

♦ 




♦ Error smaller than ±0*06%. 
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Whson-Hili'brty approximation fob X^-ADJTTSTMENT 
This formula (Wilson & Hilferty, 1931) for the percentage points of the distribution is 

where V represents the degrees of freedom, and yp the standardized normal deviate corresponding to 
probability level F. A table has been published in Biometrika (Morrington, 1941), comparing tho 
approximations derived from this formula with tho exact values, at various probability levels between 
P = 0*995 and P = 0*005. It shows the remarkable accuracy of the formula, tho maximum errors 
varying from about ± 0*04, when v 30, to about ± 0*024, when f 100. 

When these errors were plotted against the exact values on logarithmic paper, it was observed that 
for a given probability level, they varied inversely with aJv very closely. It follows that this square root 
relation may be used to adjust the Wilson-.Hilferty formula, bringing the values computed therefrom 
still nearer to the exact values. 

If the difference (at v == 30) between the Wilson-Hilferty value and the exact value, when multiplied 
by V(100/30), is treated as a coefficient C (which may be positive or negative), the required adjustment 
for any value of v is given by 

Adjustment = Oj^/p, 

^'or various probability levels P, the values of C are given in the following table; 


p 

0 

P 

G 

0'996 

+ 0-233 


+ 0*039 


+ 0-157 


+ 0-066 

0*975 



+ 0-036 


+ 0*011 





0*010 





-0-227 


-0-013 

— 

— 


A test against the Herrington Table shows that this adjustment leads to values of between p 30 
and p =: 100 at all probability levels with an accuracy of ±0*001, i.e. to four or five significant figures. 
Since the Wilson-Hilferty approximation assumes a normal distribution about 1 — 2/(9v), which tends 
to unity as P increases to infinity, and since the adjustment tends to zero under those conditions, it 
follows that the latter may also be safely applied for an indefinitely large p. 

It should be added that an adjustment on similar principles is not applicable to the Fisher approxi¬ 
mation for 
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A First Course in Mathematical Statistics. By C. B. Weathbrbtjrk. Cambridge 
University Press. Price 16s. 

An outstanding feature of the present statistical time is the munher of text-books which are being 
written, and each one from a slightly different point of view. It ia this which makes statistical theory 
interesting to study, for there can be no rigid approach to a subject which is used and expounded by so 
many and diverse persons. Professor Weatherburn has taken a rather formal mathematical exposition 
of the subject, and mathematical students will find his book both interesting and profitable to read. 
Numerical examples are given for the reader to apply the appropriate mathematical technique. It is 
possible that these would have been of greater utility if they had contained the material in its crude state, 
and had not been streamlined so that the application of the technique is immediately obvious, but 
nevertheless many new examples are there. 

I am not sure whether this book will be entirely tiseful to students of other subjects than mathematics. 
While the mathematical analysis is undoubtedly clear it is possible that many will not be able to follow 
it in detail, and the conclusions of the analysis are not emphasized strongly. We may contrast with this 
Fisher’s Statistical Methods for Besearoh Workers^ where no analysis is given, but where the relevant 
formulae and their interpretation are stated unmistakeably and their applications to material in its 
crude state set out so that the student may calculate for himself. 

Probability theory is the foundation stone on which the whole of statistical theory is built. It is dis¬ 
appointing therefore to find that it is given somewhat perfunctory treatment in one chapter and the part 
it plays in (say) statistical tests of significance is not brought out and emphasized. There is a tendency 
nowadays in applying statistical technique to regard the 6 % and 1 % levels of significance as sacrosanct 
and those coming fresh to the subject should learn that custom is the only reason, for their choice. 

In spite of the criticisms which I make, however, I would recommend this book to students who have 
obtained some idea of the aims and objectives of statistical theory, and who are desirous of learning the 
development of the mathematical technique as well as its application. Professor Weatherbum’s mathe¬ 
matical analysis makes pleasant reading and may well throw new light on old methods for those who 
have learnt the rudiments of the theory. 

F. K. DAVID 


Advances in Genetics, Volume 1. New York, N.Y.: Academic Press. 1947. 

This is the first number of a new periodical, probably an,annual, summarizing recent work in various 
fields. Of the nine articles, ranging from 12 to 96 pages, with mean 42-6, s.d. 7‘89, and a positively skew 
distribution, perhaps the most interesting to European geneticists will be that on the genetics of the 
ciliate Protozoa, Paramecium and Buploles. Here Sonneborn describes work almost entirely done in 
America, with very surprising results. Thus Paramecium aurelia consists of at least seven endogamoua 
varieties, each with two exogamous mating types, which might be called sexes were it not that in 
P. bursaria one of the varieties has no less than eight mating typos. 

Shrodo and Lush’s article of the genetics of cattle gives a very condensed account of the large amount 
of work which has been done on tlie inheritance of economically important characters such as milk yield 
and growth rate. For example cattle biomotrioians have used the important concept of ‘heritability’, 
moaning the fraction of the variance of a character due to additive genetic differences. Within a herd 
this rarely exceeds 30 %. More space is devoted to work on the genetics of colour and the like, which is 
of far less economic importance, and the review of progeny testing methods is disappointingly brief. 
However, the bibliographical references will be useful. Similarly, Atwood’s article on forage crops, though 
most valuable as a guide to the literature, does not give a detailed account of any of the biometric work 
which has been done on grasses and clovers. 

Only two of the papers give data which a biometrician could immediately utilize. These are Gordon’s 
account of polymorphism in fish populations, and Spencer’s of mutations in wild Drosophila species, 
which unfortunately does not include some valuable recent Italian and Russian work. Gordon’s results 
call for the development of methods of estimating gene frequency similar to those used with h^an blood 
groups. Spencer is mainly concerned with results, but these are often given in sufficient detail to interest 
biometricians, though no attempt is made to summarize Wright’s fundamental statistical theory. 
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The other articles will be less attractive to biometricians, though it is of interest to seo how statistical 
methods are demanded by the mere fact that the genus GfepiSt whose evolution is reviewed by Babcock, 
includes 196 species, most of which have beerl examined eytologically, and between which 130 of the 
38,220 possible crosses have been made. 

The volume will be indiapensablo to geneticists. Biometricians certainly cannot neglect it. 

j. n. s. H. 

Mathematical Methods of Statistics. By H. Cramer, Princeton University Press. 1946. 

$ 6 . 00 . 

This book was written by Prof. Cramer during the war and has been published first in Sweden and then 
by an ofisot process by the Princeton University Press in the U.S.A. It is a definitive exposition of the 
theory of mathematical statistics as it existed in 1940 (about) and it is worth while therefore to consider 
its contents in some detail Prof. Cram6r has divided his exposition into three parts; the first part is 
purely mathematical, The theory of sets and of such Lebesgue measure as is necessary for the under¬ 
standing of the second part is developed first of all. Such a development will be useful for the student 
of mathematical statistics coming fresh to the theory of measure in that he receives guidance as to what 
are the elements essential for him to imderstand. Chapters 11 and 12 on matrices determinants and 
quadratic forms and miscellaneous complements do not fit into this general scheme but have obviously 
been included here as part of the mathematical equipment necessary for the student. Possibly Chapter 10 
on Fourier Integrals would have fitted more naturally into Part II but tliis is a matter of taste. 

Part II begins with a formal development of the theory of probability as given by the French and 
Eussian schools of probability, and which Prof. Cramer has already given in his Cambridge tract ‘ Random 
Variables and Probability Distributions \ The treatment here seems simpler, however, than in his earlier 
tract and there is a more practical flavour to his exposition. This part while still purely mathematical 
begins to introduce distributions and ideas which are familiar to the statistician. 

The title of the third part is ‘ Statistical Inference ’ and the main outline is that of small sample theory 
^developed during the past twenty-five years, The illustrations arc numerical as well as mathematical 
and an attempt is made to show the student the numerical applications of the processes through which 
his mathematical theory leads him. The treatment is not exhaustive but the student who has assimilntod 
this part will have little difficulty in extending his Imowledgo by fui'thor reading. 

As a textbook of mathematical statistics this book will remain unrivalled for many years to come. The 
mathematical exposition is clear, the development of ideas logical throughout, and the theorems are 
presented in a very general way. Any student of mathematics who wishes to get a picture of what statis¬ 
tical theory is about will be led inevitably to a study of this book. To those who wish to become statisticians 
it will be necessary to supplement the reading by a practical course in which the mathematical tools 
are tried out on numerical examples. This aspect of statistical work the book does not cover, but it is 
obvious that this would he the case from the title. It only remains to say to the student ‘ This is a good 
book, buy it I F. N. david 


CORRIGENDA 

{Biometnlca, 34, 176-7) 

In J. Wishart’a paper on ‘The cumulants of the z and of tlio logaritlimic and t diatributions’, 
the following correction should be made; 

p. 176, 1st line of section 3: r&ad ‘log| t |’/or ‘log t’, in two places, 
p. 177, Ist line following equation (32): read ‘log |’/or ‘log x'. 
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